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Summary of the Invention 
The present invention is based, at least in part, 
on the discovery of cDNA molecules encoding TANGO 191 and 
TANGO 195, both of which are transmembrane proteins. 
5 These proteins, fragments, derivatives, and variants 
thereof are collectively referred to as "polypeptides of 
the invention" or "proteins of the invention." Nucleic 
acid molecules encoding polypeptides of the invention are 
collectively referred to as "nucleic acids of the 

10 invention . " 

The nucleic acids and polypeptides of the present 
invention are useful as modulating agents in regulating a 
variety of cellular processes. Accordingly, in one 
aspect, this invention provides isolated nucleic acid 

15 molecules encoding a polypeptide of the invention or a 
biologically active portion thereof. The present 
invention also provides nucleic acid molecules which are 
suitable as primers or hybridization probes for the 
detection of nucleic acids encoding a polypeptide of the 

20 invention. 

The invention features nucleic acid molecules 
which are at least about 45% (or 55%, 65%, 75%, 35%, 95%, 
or 98%) identical to the nucleotide sequence shown in SEQ 
ID NO: 1, 3, 4, 6 or , or the nucleotide sequence of 

25 the cDNA insert of either the clone deposited with the 
American Type Culture Collection, Manassas, VA (ATCC) as 

Accession Number or the clone deposited with the 

ATCC as Accession Number (the " cDNA of ATCC " 

or the " cDNA of ATCC "), or a complement thereof. 

30 The invention features nucleic acid molecules 

which include a fragment of at least 30C (325, 350, 375, 
400, 425, 450, 500, 550, 600, 650, 70C, 800, 90C, 1000, 
or 1200 ) nucleotides of the nucleotide sequence of SEQ 
ID N0:1, 3, 4, 6 or or the nucleotide sequence of the 
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NOVEL SECRETED IMMUNOMODULATORY PROTEINS 
AND USES THEREOF 


10 Background of the Invention 

Many secreted proteins, for example, cytokines and 
cytokine receptors, play a vital role in the regulation 
of cell growth, cell differentiation, and a variety of 
specific cellular responses. A number of medically 

15 useful proteins, including erythropoietin, granulocyte- 
macrophage colony stimulating factor, human growth 
hormone, and various interleukins , are secreted proteins. 
Thus, an important goal in the design and development of 
new therapies is the identification and characterization 

2 0 of secreted proteins and the genes which encode them. 

Many secreted proteins are receptors which bind a 
ligand and transduce an intracellular signal, leading to 
a variety of cellular responses. The identification and 
characterization of such a receptor enables one to 

25 identify both the ligands which bind to the receptor and 
the intracellular molecules and signal transduction 
pathways associated with the receptor, permitting one to 
identify or design modulators of receptor activity, e.g., 
receptor agonists or antagonists and modulators of signal 

30 transduction. 
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Also within the invention are isolated 
polypeptides or proteins which are encoded by a nucleic 
acid molecule having a nucleotide sequence that is at 
least about 65%, preferably 75%, 85%, or 95% identical to 

5 a nucleic acid sequence encoding SEQ ID NO: 2, 5 or ; 

and isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence 
which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 

10 of SEQ ID NO:l, 3, 4, 6 or a complement thereof or 

the non- coding strand of the cDNA of ATCC or the 

cDNA of ATCC . 

Also within the invention are polypeptides which 
are a naturally occurring allelic variants of a 

15 polypeptide that includes the amino acid sequence of SEQ 

ID N0:2, 5 or an amino acid sequence encoded by the 

cDNA of ATCC , or an amino acid sequence encoded by 

the cDNA of ATCC , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 

20 nucleic acid molecule having the sequence of SEQ ID NO: 

1, 3, 4, 6 or or a complement thereof under stringen- 

conditions. 

The invention also features nucleic acid molecules 
that hybridize under stringent conditions to a nucleic 

25 acid molecule having the nucleotide sequence of SEQ ID 

NO:l ( 3, 4, or 6, the cDNA of ATCC or the cDNA of 

ATCC , or a complement thereof. In other 

embodiments, the nucleic acid molecules are at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

30 80C. 90C , 1000, or 1290) nucleotides in length and 

hybridizes under stringent conditions to a nucleic acid 
molecule having che nucleotide sequence of SEQ ID N0:1, 

3, 4, 6, or the cDNA ATCC , or the cDNA of ATCC 

, cr a complement thereof. 
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cDNA of ATCC or the cDNA of ATCC , or a 

complement thereof . 

The invention also features nucleic acid molecules 
which include a nucleotide sequence encoding a protein 
5 having an amino acid sequence that is at least about 45% 
(or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the 

amino acid sequence of SEQ ID NO: 2, 5 or or the amino 

acid sequence encoded by the cDNA of ATCC or the 

cDNA of ATCC , or a complement thereof. 

10 In preferred embodiments, the nucleic acid 

molecules have the nucleotide sequence of SEQ ID NO: 1, 

3, 4, 6 or or the nucleotide sequence of the cDNA of 

ATCC , or the cDNA of ATCC . 

Also within the invention are nucleic acid 

15 molecules which encode a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2 or 5 the fragment 
including at least 15 (25, 30, 50, 100, 150, 300, or 400) 
contiguous amino acids of SEQ ID NO: 2 or 5, the 
polypeptide encoded by the cDNA of ATCC , or the 

2 0 polypeptide encoded by the cDNA of ATCC . 

The invention includes nucleic acid molecules 
which encode a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2. 5 or the amino acid sequence encoded bv the 

2 5 cDNA of ATCC , or the amino acid sequence encoded by 

the cDNA of ATCC , wherein the nucleic acid molecule 

hybridizes to a nucleic acid molecule having a nucleic 

acid sequence encoding SEQ ID NO: 2, 5 or or a 

complement thereof under stringent conditions. 

30 Also within the invention are isolated 

polypeptides or proteins having an amino acid sequence 
that is at least about 65% preferably 75%, 85%, 95%, or 
98% identical to the amino acid sequence of SEQ ID NO: 2, 
5 or 
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polypeptide; (2) the ability to bind a iigand of the 
naturally-occurring polypeptide; (3) the ability to bind 
to an intracellular target of the naturally-occurring 
polypeptide. Other activities include: (l) the ability 
5 to modulate cellular proliferation; (2) the ability to 
modulate cellular differentiation; and (3) the ability to 
modulate cell death. 

In one embodiment, a polypeptide of the invention 
has an amino acid sequence sufficiently identical to at 

10 least one domain of a polypeptide of the invention. As 
used herein, the term "sufficiently identical" refers to 
a first amino acid or nucleotide sequence which contains 
a sufficient or minimum number of identical or equivalent 
(e.g., with a similar side chain) amino acid residues or 

15 nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide 
sequences have a common structural domain and/or common 
functional activity. For example, amino acid or 
nucleotide sequences which contain a common structural 

20 domain having about 65% identity, preferably 75% 

identity, more preferably 85%, 95%, or 98% identity are 
defined herein as sufficiently identical. 

In one embodiment, the isolated polypeptide lacks 
both a transmembrane and a cytoplasmic domain. In 

2 5 another embodiment the polypeptide lacks both a 

transmembrane domain and a cytoplasmic domain and is 
soluble under physiological conditions. 

The polypeptides of the present invention, or 
biologically active portions thereof, can be operably 

3 0 linked to a heterologous amino acid sequence to form a 

fusion protein. The invention further features 
antibodies that specifically bind a polypeptide of the 
invention such as monoclonal or polyclonal antibodies. 
In addition, the polypeptides of the invention or 
35 biologically active portions thereof can be incorporated 
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In preferred embodiments, the isolated nucleic 
acid molecules encode a cytoplasmic (SEQ ID NO: 11 or 16) , 
transmembrane (SEQ ID NO:10 or 15), or extracellular (SEQ 
ID NO: 9 or 14) domain of a polypeptide of the invention 
5 or a complement thereof. In another embodiment, the 
invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a nucleic acid 
of the invention. 

Another aspect of the invention provides vectors, 

10 e.g., recombinant expression vectors, comprising a 
nucleic acid molecule of the invention. In another 
embodiment, the invention provides host cells containing 
such a vector. The invention also provides methods for 
producing a polypeptide of the invention by culturing, in 

15 a suitable medium, a host cell of the invention 

containing a recombinant expression vector such that a 
the polypeptide is produced. 

Another aspect of this invention features isolated 
or recombinant proteins and polypeptides of the 

20 invention. Preferred proteins and polypeptides possess 
at least one biological activity possessed by the 
corresponding naturally-occurring human polypeptide. An 
activity, a biological activity, and a functional 
activity of a polypeptide of the invention refers to an 

25 activity exerted by a protein, polypeptide or nucleic 
acid molecule of the invention on, for example, a 
responsive cell as determined in vivo, or in vitro, 
according to standard techniques. Such activities can be 
a direct activity, such as an association with or an 

30 enzymatic activity on a second protein or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the protein with a second protein. 
Thus, such activities include, e.g., (i) the ability to 
form protein: protein interactions with proteins in the 

3 5 signaling pathway of the naturally-occurring 
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nucleic acid of the invention. In other embodiments, the 
modulator is a peptide, peptidomimetic , or other small 
molecule . 

The present invention also provides diagnostic 
5 assays for identifying the presence or absence of a 

genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a polypeptide of the invention, (ii) mis- 
regulation of a gene encoding a polypeptide of the 

10 invention, and (iii) aberrant post-translational 

modification of a protein of the invention wherein a 
wild- type form of the gene encodes a protein having the 
activity of the protein of the invention. 

In another aspect, the invention provides a 

15 method fcr identifying a compound that binds to or 

modulates the activity of a polypeptide of the invention. 
In general, such methods entail measuring a biological 
activity of the polypeptide in the presence and absence 
of a test compound and identifying those compounds which 

20 alter the activity of the polypeptide. 

The invention also features methods for 
identifying a compound which modulates the expression of 
a polypeptide or nucleic acid of the invention by 
measuring the expression of the polypeptide or nucleic 

25 acid in the presence and absence of the compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims. 


Brief Description of the Drawings 
30 Figure 1 depicts the cDNA sequence (SEQ ID NO:l) 

and predicted amino acid sequence (SEQ ID NO: 2) of human 
TANGC 1S1. The open reading frame of SEQ ID NO:l extends 
from nucleotide 557 to 2353, inclusive (SEQ ID NO:3). 
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into pharmaceutical compositions, which optionally 
include pharmaceutical^ acceptable carriers. 

In another aspect, the present invention provides 
methods for detecting the presence of the activity or 
5 expression of a polypeptide of the invention in a 

biological sample by contacting the biological sample 
with an agent capable of detecting an indicator of 
activity such that the presence of activity is detected 
in the biological sample. 

10 In another aspect, the invention provides methods 

for modulating activity of a polypeptide of the invention 
comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a 
polypeptide of the invention such that activity or 

15 expression in the cell is modulated. In one embodiment, 
the agent is an antibody that specifically binds to a 
polypeptide of the invention. 

In another embodiment, the agent modulates 
expression of a polypeptide of the invention by 

20 modulating, transcription, splicing, or translation of an 
mRNA encoding a polypeptide of the invention. In yet 
another embodiment, the agent is a nucleic acid molecule 
having a nucleotide sequence that is antisense cc the 
coding strand of an mRNA encoding a polypeptide of the 

2 5 invention. 

The present invention also provides methods for 
treating a subject having a disorder characterized by 
aberrant activity of a polypeptide of the invention or 
aberrant expression of a nucleic acid or polypeptide of 

3 0 the invention by administering an agent which is a 

modulator of the activity of a polypeptide of the 
invention or a modulator of the expression of a nucleic 
acid or polypeptide of the invention to the subject. In 
one embodiment, the modulator is a protein of the 
3 5 invent ion. In another embodiment, the modulator is a 
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Detailed Description of the Invention 
The present invention is based on the discovery of 
cDNA molecules encoding TANGO 191 and TANGO 195, both of 
which are secreted proteins. The subsections and Tables 
5 summarize certain features of TANGO 191 and TANGO 195. 


TANGO 191 

The human TANGO 191 cDNA of SEQ ID NO:l has a 1797 
nucleotide open reading frame (SEQ ID NO: 3) encoding a 
599 amino acid protein (SEQ ID NO: 2) . The cDNA and 

10 protein sequences of human TANGO 191 are shown in Figure 
1. This cDNA was isolated from a human mixed lymphocyte 
reaction library based on its sequence similarity to 
genes encoding certain members of the interleukin-1 (IL- 
1) receptor super family. 

15 Human TANGO 191 is a transmembrane protein having 

' a 19 amino acid signal sequence (amino acids 1 - 19 of 
SEQ ID NO: 2; SEQ ID NO: 7) followed by a 580 amino acid 
mature protein (amino acids 20 - 599 of SEQ ID NO:2; SEQ 
ID NO : 8 ) . Mature TANGO 191 is predicted to have a 

20 transmembrane domain that extends from amino acid 3 58 to 
amino acid 382 of SEQ ID NO:2 (SEQ ID NO:10), an 
extracellular domain that extends from amino acid 20 to 
amino acid 357 of SEQ ID NO:2 (SEQ ID NO:9), and a 
cytoplamic domain extending from amino acid 383 to amino 

25 acid 599 of SEQ ID NO:2 (SEQ ID NO: 11) . 

TANGO 191 has a molecular weight of 68.3 kDa prior 
to cleavage of its signal peptide and a molecular weight 
of 66.1 kDa after cleavage of its signal peptide. 

TANGO 191 has four potential N-glycosylat ion sites 

30 (amino acids 21-24, 119-122, 152-155, and 345-248 of SEQ 
ID NO:2); 15 potential protein kinase C phosphorylation 
sites (amino acids 26-28, 35-37, 63-65, 160-162, 203-205, 
233-235, 272-275, 307- 309, 311-313, 327-329, 474-476, 
506-508, 538-540, 575-577, and 590-592 of SEQ ID NO:2); 
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Figure 2 is a hydropathy plot of TANGO 191. 
Relative hydrophobici ty is shown above the line marked 
"0"; and relative hydrophilicity is shown below the line 
marked H C". 

5 Figure 3 depicts the cDNA sequence (SEQ ID NO:4) 

and predicted amino acid sequence (SEQ ID NO: 5) of a 
partial human TANGO 195 clone. The open reading frame of 
SEQ ID NO: 4 extends from nucleotide 166 to 1101, 
inclusive (SEQ ID NO: 6) . 
10 Figure 4 is a hydropathy plot of TANGO 195. 

Relative hydrophobicity is shown above the line marked 
"0", and relative hydrophilicity is shown below the line 
marked H 0 " . 

Figure 5 depicts an alignment of the amino acid 
15 sequences of human TANGO 195 (SEQ ID NO: 5) and human SLAM 
(SEQ ID NO: 20) . In this alignment the sequences are 
22,8% identical overall. 

Figure 6 depicts an alignment of portions of TANGO 
191 with PF00047, an IG superfamily domain HMM (SEQ ID 
20 NOs :21, 22, and 23) . 

Figure 7 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ; of 

murine TANGO 195. The open reading frame of SEQ ID 

NO: extends from nucleotide 42 to 875, inclusive (SEQ 

25 ID NO: ) . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: ; 

and predicted amino acid sequence (SEQ ID NO: ) of a 

partial human TANGO 195 clone (T195Athpb93f 1) . The open 
reading frame extends from nucleotide 159 to 1118, 

30 inclusive (SEQ ID NO: ) . 

Figure 9 depiccs the cDNA sequence (SEQ ID NO: 

and predicted amino acid sequence (SEQ ID NO: '; of a 

full length TANGO 195 clone (T195AthLal70f 10) . The open 
reading frame extends from nucleotide 25 to nucleotide 
35 879, inclusive (SEQ ID NO: ). 
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which engages a protein complex that acts to activate NK- 
yB. Members of the NK-yB family regulate many of immune 
and inflammatory genes that are induced by IL-l. 

Since TANGO 191 has some similarity to IL-l 
5 receptor, TANGO 191 nucleic acids and polypeptides as 
well as modulators of TANGO 191 expression or activity 
are useful in the treatment of a variety of immune and 
inflammatory disorders, e.g., asthma, graft vs-host 
disease, rheumatoid arthritis, psoriasis, inflammatory 
10 bowel disease, septic shock, ulcerative colitis, Crohn's 
disease, chronic myelogenous leukemia, cancer, liver 
disease, Hodgkin's disease., osteoarthritis, Lyme disease, 
cachexia, and autoimmune diseases, e.g., myasthenia 
gravis, autoimmune diabetes, and lupus. 

15 TANGO 195 

The human TANGO 195 partial cDNA of SEQ ID NO: 4 
has a 93 6 nucleotide open reading frame (SEQ ID NO: 6) 
encoding a 312 amino acid protein (SEQ ID NO: 5) . The 
cDNA and protein sequences of human TANGO 195 clone are 

20 shown in Figure 3. This partial TANGO 195 cDNA clone was 
isolated from a human mixed lymphocyte reaction library 
based or. its homology to signalling lymphocyte activation 
marker (Cocks et al . (1995) Nature 376:260-63). Apparen- 
full- length clones (3.0 kb and 1.3 kb) were isolated from 
.25 the same library and a human mid- term placental library. 

The portion of human TANGO 195 encoded by the cDNA 
of SEQ ID NO: 4 is a transmembrane secreted protein having 
a 22 amino acid signal sequence (amino acids 1 - 22 of 
SEQ ID NO: 5; SEQ ID NO: 12) followed by a 2 91 amino acid 

30 mature protein (amino acids 23 - 312 of SEQ ID NO: 5; SEQ 
ID NO: 13 ; . The portion cf mature TANGO 195 encoded by 
the cDNA cf SEQ ID NO: 4 is predicted to have a 
transmembrane domain that extends from amino acid 23 4 to 
amine acid 254 of SEQ ID NO:5 (SEQ ID NO:15), an 
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12 potential casein kinase II phosphorylation sites 
(amino acids 36-39, 89-92, 133-136, 224-227, 294-297, 
301-304, 311-314, 327-330, 401-404, 427-430, 490-493, and 
585-588 of SEQ ID NO : 2 ) ; one potential tyrosine kinase 
5 phosphorylation site (amino acids 205-212 of SEQ ID 
NO: 2); and six potential N-myristoylation sites (amino 
acids 117-122, 168-173, 217-222, 366-371, 460-465, and 
583-588 of SEQ ID NO:2) . 

Figure 2 is a hydropathy plot of TANGO 191. 
10 Relative hydrophobicity is shown above the line marked 
"0", and relative hydrophilicity is shown below the line 
marked " 0 " . 

Northern analysis of human TANGO 191 mRNA 
expression revealed that it is expressed in spleen, lymph 
15 node, peripheral blood lymphocytes, and bone marrow. 

A clone (EPftX191a) containing a cDNA encoding 
TANGO 191 inserted into pZL-1 (GIBCO/BRL; Bethesda, MD) 
between the NotI and Sail sites was deposited with the 
American Type Culture Collection, Manassas, VA on 

20 September 9, 1998, and assigned Accession Number . 

Human TANGO 191 appears to be a member of the IL-1 
receptor superfamily. TANGO 191 includes three regions 
(amino acids 71-123 of SEQ ID N0:2; SEQ ID NO:17; ; amino 
aicds 168-223 of SEQ ID NO:2; SEQ ID NO:18); amino acids 
25 266-339 of SEQ ID NO : 2 ; SEQ ID NO: 19) which have homology 
to the IG superfamily domain (PF00047) that is 
characteristic of members of the IL-1 superfamily (Figure 
6) . 

IL-1 receptor (IL-1R) plays a critical role the 
30 regulation of immune and inflammatory responses. 

Signalling by IL-1R requires that IL-1R form a complex 
with IL-lAcP, a protein which may be required for 
internalization of IL-1R. It is thought that both IL-1R 
and IL-lAcP interact with IRAK- 2. It has been proposed 
35 that this multiprotein complex interacts with TRAF6 , 
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full length TANGO 195 clone (T195AthLal70f 10 ) . The open 
reading frame extends from nucleotide 2 5 to nucleotide 

879, inclusive (SEQ ID NO: ) . 

The full-length TANGO 195 protein of Figure 9 is 
5 predicted to be a transmembrane protein having a 22 amino 
acid signal sequence (amino acids 1-22 of SEQ ID NO: 

SEQ ID NO: ) followed by a 2 63 amino acid mature 

protein (amino acids 23-285 of SEQ ID NO: ) . This form 

of TANGO 195 is predicted to have a transmembrane domain 

10 extending from amino acid 234 to amino acid 254 of SEQ ID 

NO: (SEQ ID NO: ), an extracellular domain extending 

from amino acid 23 to amino acid 233 of SEQ IE} NO: 

(SEQ ID NO: ) and a cytoplasmic domain that extends 

from amino acid 255 to amino acid 28 5 of SEQ ID NO: 

15 (SEQ ID NO: ) . 

In situ expression analysis of TANGO 195 in adult 
mice revealed expression in the spleen (mutlifocal 
expression with expression highest in follicles) , thymus 
(multifocal expression) , and lymph node (multifocal 

20 expression) . No expression was detected in lung and 
stomach. In situ expression analysis was also used to 
examine expression in the spleens of adult mice 1 , 3, 5, 
and 14 post -immunization with EFA/PBS. In each case 
multifocal expression was observed with expression being 

25 highest in the follicles. The expression at 14 days 
post -immunization was somewhat lower than in at other 
time points. 

Northern analysis of TANGO 195 expression revealed 
the presence of a 1.8 kb transcript and a 3.4 kb 

30 transcript in spleen, lymph node and thymus and a 1.8 kb 
transcript was observed in bone marrow with expression 
being highest in lymph node. Additional Northern 
analysis revealed expression in the following tissues (in 
decreasing order of expression) : lymph node, stomach, 

35 small intestine, appendix, lung, spleen, and bone marrow. 
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extracellular domain that extends from amino acid 23 to 
amino acid 233 of SEQ ID NO:5 (SEQ ID NO:14), and a 
cytoplamic domain extending from amino acid 255 to amino 
acid 312 of SEQ ID NO:5 (SEQ ID NO:16). 
5 TANGO 195 is a type I transmembrane protein 

belonging to the CD 2 subgroup of the immunoglobulin 
superf amily . 

The TANGO 195 of SEQ ID NO : 5 has three potential 
N-glycosylation sites (amino acids 85-88, 100-103, and 

10 156-159 of SEQ ID NO:5); three potential protein kinase C 
phosphorylation sites (amino acids 163-165, 230-232, and 
308-310 cf SEQ ID NO:5); three potential casein kinase II 
phosphorylation sites (amino acids 168-171, 215-218, and 
230-233 cf SEQ ID NO:5); one potential tyrosine kinase 

15 phosphorylation site (amino acids 65-72 of SEQ ID NO:5); 
one potential cGMP- dependent protein kinase 
phosphorylation site (amino acids 165-168 of SEQ ID 
NO: 5); and three potential N-myristoylation sites (amino 
acids 66-71, 110-115, and 183-188 of SEQ ID NO:5) . 

20 Figure 4 is a hydropathy plot of the TANGO 195 of 

SEQ ID NO: 5. Relative hydrophobicity is shown above the 
line marked "0", and relative hydrophilicity is. shown 
belov; the line marked "0". 

Figure 7 depicts the cDNA sequence (SEQ ID NO: ; 

2 5 and predicted amino acid sequence (SEQ ID NO: } of 

murine TANGO 195. The open reading frame of SEQ ID 

NO: extends from nucleotide 42 to 876, inclusive (SEQ 

ID NO: ) . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: ) 

3 0 and predicted amino acid sequence (SEQ ID NO: } of a 

parcial human TANGO 155 clone (T195Athpb93f 1 ) . The open 
reading frame 0 excends from nucleotide 159 to 1116. 

inclusive (SEQ ID NO: ) . 

Figure 9 depicts the cDNA sequence (SEQ ID NO: ) 

35 and predicted amino acid sequence (SEQ ID NO: ) of a 
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hours) . The strongest expression was seen in monocytes 
activated with gIFN (0.2 or 2ug/ml) giving 2 bands of 
approximately 1.5-2 kb and 3-3.5 kb (with the upper band 
stronger) . Addition of LPS did not increase expression 
5 in monocytes or PBMCs and may in fact lead to a slight 
decrease in expression in gIFN stimulated monocytes. 

TANGO 195 function was investigated by 
reconstituting irradiated mice with bone marrow cells 
infected with retrovirus expressing either full length 

10 murine TANG0195 ("T195fl") or the extracellular domain of 
murine TANGO 195 ( ,, T195ex" ) . The donor and recipient 
mice were C57BL/6 and congenic for CD45 (CD45.1 for 
donor, CD4 5.2 for recipient). Recipient mice were 
analyzed approximately 10 and 14 weeks after 

15 transplantation for blood chemistry, hematology and 

tissue histology. Peripheral blood, spleen, lymph node 
and thymus cells were analyzed by FACs analysis and TANGO 
195 RNA levels were analyzed in spleen. The percentage of 
infected cells, based on the percentage of G418-resistant 

20 donor cells, was 54% (T195fl) and 69% (T195ex) . The level 
of TANGO 195 RNA expression in the spleen of. recipient 
mice, based upon slot blot analysis (GAPDH control) , was 
10 cimes that of control mice (328% of GAPDH for T195ex 
and 232% of GAPDH for T195fi compared to 33% of GAPDH in 

25 control mice) . 

Expression of T195ex led to a statistically 
significant increase in triglyceride levels (ave = 97.6 
mg/dl at 12 weeks; ave = 94.3 mg/dl at 16 weeks) in serum 
as compared to control mice (ave = 52.5 mg/dl at 12 week; 

30 ave = 63.1 mg/dl 16 weeks) or mice expressing T195fl (ave 
= 47.9 mg/dl at 12 weeks; ave = 67.5 mg/dl at 16 weeks). 


Expression of T195fl had a variety of effects on 
lymphocytes. In T195fl expressing mice, FACs analysis of 
peripheral blood showed an increase in total B220hi 
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However, a probe based on the open reading frame did not 
detect expression in lung or stomach. 

Additional Northern analysis revealed that TANGO 
195 is expressed in activated human monocytes/macrophages 
5 and, at lower level, in activated human lymphocytes. 
This analysis also revealed that cytokine induced 
differentiation of T cells appears to regulate TANGO 195 
expression. To carry out this analysis, PBMCs were 
purified from human buffy coat by ficol gradient 

10 ce-ntrifugation as per manuf acuurers instructions (Sigma) . 
PBMCs were activated for 24 hours with lug/ml LPS. Using 
Miltenyi Biotech positive selection beads, CD4+, CD8+ and 
CD19+ cells were isolated from resting PBMCs. The CD8+ 
cells were activated for 24 hours using plate bound anti- 

15 CDS. Resting monocytes were purified from PBMCs by 

gradient centrif ugation and stimulated with 0, 0.01, 0.1 
or lug/ml LPS either with or without 2ng/ml gIFN for 4 
hours or with 0, 0.2 or 2 ng/ml gIFN for 4 hours. 
Activated T cells (predominantly CD4+ cells) were 

20 prepared from PBMCs by stimulation with plate bound anti- 
CD3 (TR66) for 3 days in RPMI 10% FBS . Cells were then 
expanded without: anti-CD3 but in the presence of IL2 for 
22 days. Cells were resuspended at 107/mi in fresh RPMI 
10% FBS with no exogenous cytokines, or IL10 (20ng/ml) 

25 plus IL4 (40ng/ml) or TNFa (5,000u/ml) plus gIFN 

(15ng/ml) . (All cytokines were purchased from Genzyme . ) 
Cells were harvested after 8 and 24 hours. RNA was 
prepared from all cell types using RNeasy mini kit 
(Qiagen) and expression was analyzed by standard Northern 

30 analysis using approx lOug total RNA. No expression was 
seen in resting leukocytes PBMCs, CD4+, CD8+, CD19 + or 
monocytes. A single band of approximately 3-3.5 kb was 
seen in activated CD8+ cells and in T cells activated ' 
with no exogenous cytokines or the combination of IL10 

35 and IL4, but not with TNFa and gIFN (after both 3 and 24 
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These results also suggest that TANGO 195 may play 
a role in B cell leukemia, immune response, and 
autoimmune disorders (e.g., arthritis). 

TANGO 195 maps to human chromosome locus hulq21. 
5 The flanking markers are AFMA323ZE5 and D1S2635. The 
among identified loci in close proximity to TANGO 195 are 
HYPLP1 ( hyper lipidemial) and LPD1 (lipodystrophy) . 
Nearby known human genes include: SPTA1 (spectrin, 
alpha), THBS3 { thrombospondin 3), MTX (metaxin) , CTSS 

10 (cathepsin K,S), FLG (filaggrin) , PKLR (pyruvate kinase), 
HYPLIP1 (hyperlipidemia) . 

The mouse chromosome corresponding to the human 
chromosomal locus is chromosome 3. Nearby mouse loci 
include: soc (soft coat), hyplipl (hyperlipidemia), ft 

15 (flaky tail) and ma (matted) . Nearby mapped mouse genes 
include: Imna (lamin A), fig (filaggrin), bean 
(brevican) , gba (acid beta glucosidase) . 

Rabbit polyclonal antibodies were raised against 
three peptides from murine TANGO 195 peptides to amino 

20 acids 26-34, 102-117 and 161-176. Peptide purified sera 
from rabbits immunized with amino acids 102-117 
specifically recognizes mouse T195-hFc protein by 
scandarad Western Blot. Additionally unpurified sera 
from rabbits immunized with amino acids 102-117 recognize 

25 mouse T195-hFc by ELISA. ELISA plates were coated with 5 
Tg/ml mouse T195-hFc or human lg control in PBS overnight 
at 4°C. Plates were washed and blocked with PBS 1% PSA. 
Serial dilutions of serum were added and incubated for 
approximately 2 hours at room temperature. Plates were 

30 washed and rabbit immunoglobulin detected with anti- 
rabbit lg-HRP. Serum from rabbits immunized with the 
peptide corresponding to amino acids 102-117 showed 
greater than 20 fold higher titres against mouse T195-hFc 
compared to human Ig, and showed greater than 20 fold 
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(CD45Rhi) , IgD+ B cells compared to control mice or mice 
expressing T195ex. The B220hi (CD45Rhi) , IgD+ B cells 
showed low levels of expression of Macl which is 
generally regarded as a marker for cells of the 
5 monocyte/macrophage lineage. FACs analysis of peripheral 
blood showed a slight decrease in CD4+ T cells in T195fl 
expressing mice compared to control mice and T195ex 
expressing mice. FACs analysis of spleen and lymph node 
cells showed a similar increase in B220 + IgD+ Macllo 

10 cells. These cells had similar levels of surface IgD and 
IgM compared to B220+ cells from control and T195ex mice. 
CD45.1 staining confirmed that the B220 + IgD+ Macllo 
cells in spl een , lymph nodes and peripheral blood were 
derived from donor bone marrow. 

15 In T195fl expressing mice, FACs analysis of 

peritoneal lavage cells showed an increase in total B220- 
cells, an increase in B220+ CD231o cells, and an increase 
in B220+ Macllo cells compared to control mice of T195ex 
expressing mice. B220 + cells showed slightly lower 

20 levels of IgD compared to B220 + cells from control mice 
or T195ex expressing mice. In addition, B220+ CD231o 
cells in T195fl expressing mice showed significantly 
lower expression of B220 compared to control and T195ex 
mice. Finally, it was observed that very few CD5^ B220- 

25 were seen on peritoneal lavage from T195fl, T195ecd or 
control mice. 

Taken together, these results suggest that 
increased expression of T195fl in bone marrow derived 
cells leads to an increase in Bl like cells in the 

30 periphery. Since Bl cells are not normally bone marrow 
derived the exact lineage of the cells is difficult to 
determine however they appear to be Bib cells. These 
results also suggest that increased expression of T195fl 
on bone marrow derived cells may lead to a decrease in 

35 CD4 + T cells in the periphery. 
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TANGO 195 expression or activity may be useful in the 
treatment of diorders associated with abberrant B cell 
expansion or differentiation or abberrant cytokine 
production, e.g., allergic and autoimmune disorders. 

5 TABLE 1: Summary of Human TANGO 191 and TANGO 195 


Sequence Information 


Gene 

CDNA 

ORF 

Protein 

Figure 

Accession 
No. 

TANGO 19- 

SEQ ID NO:i 

SEQ ID NO: 3 

SEQ ID NO: 2 

Fig. l 


TANGO 1S5 
(partial) 

SEQ ID NO: 4 

SEQ ID NO: 6 

SEQ ID NO: 5 

Fig. 2 


TANGO 195 
(partial » 

SEQ ID NO:_ 

SEQ ID NO: 

SEQ ID NO: 

Fig. 8 


TANGO 195 
full-length 

SEQ ID NO: 

SEQ ID NO: 

SEQ ID NO: 

Fig. 9 



15 TABLE 2: Summary of Domains of TANGO 191 and TANGO 195 



Protein 

Signal 

Mature 

Extraceliula 

Transmemorane 

Cytcciasmic 



Sequence 

Protein 

TT 

Domain 

Domain 





Domain 




TANGC 191 

aa 1-19 

aa 20-599 

aa 20-357 

aa 358-382 

aa jc3-599 


SEQ ID NO: 2 

SEQ ID NO: 7 

SEQ ID NO: 8 

SEQ ID NO: 9 

SEQ ID NO: 10 

S$Q ID NO: 11 


TANGO 195 

aa 1-22 

aa 23-312 

aa 20-233 

aa 234-254 

aa 255-312 

20 

SEQ ID NO: 5 

SEQ ID 

SEQ ID NO: 13 

SEQ ID NO: 14 

SEQ ID NO: 15 

SEQ ID NO: 16 



NO: 12 






Various aspects of the invention are described in 
further detail in the following subsections 
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higher titres against mouse T195-hFc compared to control 
serum. 

Several TANGO 195/immunoglobulin constant region 
fusion proteins were created. Using human TANGO 195 a 
5 fusion protein consisting of TANGO 195 (aal-233)- 
AAPGGAS YKD - human IgGlfc was created. A second human 
TANGO 195 fusion substituted murine IgGlfc for human 
IgGlfc. Using murine TANGO 195 a fusion protein 
consisting of TANGO 195 (aal to 231) -AASGKASYKD- human 

10 IgGlfc was created. A second murine TANGO 195 fusion 
protein substituted murine IgGlfc for human IgGlfc. 

A clone (Epj thPb093fol) containing a 1.3 kb cDNA 
encoding apparently full-length TANGO 19 5 in pMET7 
between NotI and Sail was deposited with the American 

15 Type Culture Collection, Manassas, VA on September 9, 

1998, and assigned Accession Number. . 

TANGO 195 has regions that are significantly 
similar to human signalling lymphocyte activation 
molecule ("SLAM") (Accession Number U33017) . For 

20 example, the region of TANGO 195 from amino acid 173 to 
amino acid 250 has 32% identity (25/78 amino aicds) and 
50% identity (39/5 9 amino acids) to the corresponding 
region of SLAM; the region of TANGO 195 from amino acid 
134 to amino acid 164 has 32% identity (10/31 amino 

25 acids) and 41% identity (13/31 amino acids) to the 

corresponding region of SLAM; and the region of TANGO 195 
from amino acid 117 to amino acid 132 has 43% identity 
(7/16 amino acids) and 75% identity (12/16 amino acids) 
to the corresponding region of SLAM (Figure 5) . 

3 0 SLAM is thought to enhance the expansion and 

differentiation of activated B cells (Punnonen et al . 

(1997) J. Exp. Med. 185:993-1004) and in the regulation 
of cype 1 and type 2 cytokine production (Ferrante et al . 

(1998) J. Immunology 160:1514-21). TANGO 195 likely has 
35 a function similar to that of SLAM. Thus, modulators of 
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A nucleic acid molecule of the present invention, 
e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:l, 3, 4, or G, the cDNA of ATCC 

, or the cDNA of ATCC , or a complement 

5 thereof, can be isolated using standard molecular biology 
techniques and the sequence information provided herein. 
Using all or a portion of the nucleic acid sequences of 

SEQ ID NO: 1, 3, 4, or 6 , the cDNA of ATCC , or the 

cDNA of ATCC as a hybridization probe, nucleic 

10 acid molecules of the invention can be isolated using 
standard hybridization and cloning techniques (e.g., as 
described in Sambrook et al . , eds., Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 

15 Spring Harbor, NY, 1989) . 

A nucleic acid molecule of the invention can be 
amplified using cDNA, mRNA or genomic DNA as a template 
and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid 

20 so amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a 
nucleic acid molecule of the invention can be prepared by 
standard synthetic techniques, e.g., using an automated 

25 DNA synthesizer. 

In another preferred embodiment, an isolated nucleic 
acid molecule of the invention comprises a nucleic acid 
molecule which is a complement of the nucleotide sequence 
of SEQ ID NO:l, 3, 4, or 6 , the cDNA of ATCC , or 

30 the cDNA of ATCC , or a portion thereof. A nucleic 

acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently 
complementary to the given nucleotide sequence that it 
can hybridize to the given nucleotide sequence thereby 

3 5 forming a stable duplex. 
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I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode a polypeptide of the 
invention or a biologically active portion thereof, as 
5 well as nucleic acid molecules sufficient for use as 
hybridization probes to identify nucleic acid molecules 
encoding a polypeptide of the invention and fragments of 
such nucleic acid molecules suitable for use as PCR 
primers for the amplification or mutation of nucleic acid 

10 molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 

15 stranded or double-stranded, but preferably is double- 
stranded DNA. 

An "isolated" nucleic acid molecule is one which is 
separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid 
20 molecule. Preferably, an "isolated" nucleic acid 
molecule is free of sequences (preferably protein 
encoding sequences) which naturally flank the nucleic 
acid (i.e., sequences located at the 5' and 3' ends of 
the nucleic acid) in the genomic DNA of the organism from 

2 5 which the nucleic acid is derived. For example, in 

various embodiments, the isolated nucleic acid molecule 
can contain less than about 5kB, 4 kB, 3kB, 2kB, lkB, 
0.5 kB or 0.1 kB of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the 

3 0 cell from which the nucleic acid is derived. Moreover, 

an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors 
35 or other chemicals when chemically synthesized. 
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A nucleic acid fragment encoding a "biologically- 
active portion" of a polypeptide of the invention can be 
prepared by isolating a portion of any of SEQ ID NO: 3 or 

6, the nucleotide sequence of the cDNA of ATCC , or 

5 the nucleotide sequence of the cDNA of ATCC which 

encodes a polypeptide having a biological activity, 
expressing the encoded portion of the polypeptide protein 
(e.g., by recombinant expression in vitro) and assessing 
the activity of the encoded portion of the polypeptide. 

10 The invention further encompasses nucleic acid 

molecules that differ from the nucleotide sequence of SEQ 

ID NO:l, 3, 4, or 6, the cDNA of ATCC , or the cDNA 

of ATCC due to degeneracy of the genetic code and 

thus encode the same protein as that encoded by the 

15 nucleotide sequence of SEQ ID NO : 1 , 3, 4, or 6, the cDNA 

of ATCC , or the cDNA of ATCC . 

In addition to the nucleotide sequences shown in SEQ 

ID NO: 3 and 6 and present in the cDNA of ATCC and 

the cDNA of ATCC , it will be appreciated by those 

2 0 skilled in the art that DNA sequence polymorphisms that 

lead to changes in the amino acid sequence may exist 
within a population (e.g., the human population). Such 
genetic polymorphisms may exist among individuals within 
a population due to natural allelic variation. An allele 

25 is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the phrase "allelic 
variant" refers to a nucleotide sequence which occurs at 
a given locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 

30 and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide 
of the invention. Such natural allelic variations can 
typically result in 1-5% variance in the nucleotide 
sequence of a given gene. Alternative alleles can be 

3 5 identified by sequencing the gene of interest in a number 
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Moreover, a nucleic acid molecule of the invention 
can comprise only a portion of a nucleic acid sequence 
encoding a full length polypeptide of the invention for 
example, a fragment which can be used as a probe or 
5 primer or a fragment encoding a biologically active 

portion of a polypeptide of the invention. The nucleotide 
sequence determined from the cloning one gene allows for 
the generation of probes and primers designed for use in 
identifying and/or cloning homologues in other cell 

10 types, e.g., from other tissues, as well as homologues 
from other mammals. The probe/primer typically comprises 
substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 

15 conditions to at least about 12, preferably about 25, 
more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or anti- sense strand of SEQ ID NO:l, 3, 4, or 6, the cDNA 
ATCC , or the cDNA of ATCC or of a naturally 

2 0 occurring mutant of SEQ NO:l, 3, 4, or 6, the cDNA of 

ATCC , or the cDNA of ATCC . 

Probes based on the sequence of a nucleic acid 
molecule of the invention can be used to detect 
transcripts or genomic sequences encoding the same 

25 protein molecule encoded by a selected nucleic acid 
molecule. The probe comprises a label group attached 
thereto, e.g., a radioisotope, a fluorescent compound, an 
enzyme, or an enzyme co- factor. Such probes can be used 
as part of a diagnostic test kit for identifying cells or 

30 tissues which mis-express the protein, such as by 

measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., 
detecting mRNA levels or determining whether a gene 
encoding the protein has been mutated or deleted. 
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As used herein, the term "hybridizes under stringent 
conditions'* is intended to describe conditions for 
hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
5 identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989) , 
6.3.1-6.3.6. A preferred, non-limiting example of 

10 stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate {SSC) at about 45°C, 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 
50-65°C. Preferably, an isolated nucleic acid molecule 
of the invention that hybridizes under stringent 

15 conditions to the sequence of SEQ ID NO:l, 3, 4 , or 6, 

the cDNA of ATCC , or the cDNA of ATCC , or the 

complement thereof, corresponds to a naturally-occurring 
nucleic acid molecule. As used herein, a "naturally- 
occurring" nucleic acid molecule refers to an RNA or DNA 

20 molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants 
of a nucleic acid molecule of the invention sequence that 
may exist in the population, the skilled artisan will 

25 further appreciate that changes can be introduced by 
mutation thereby leading to changes in the amino acid 
sequence of the encoded protein, without altering the 
biological activity of the protein. For example, one can 
make nucleotide substitutions leading to amino acid 

30 substitutions at "non-essential" amino acid residues. A 
"non-essential " amino acid residue is a residue that can 
be altered from the wild-type sequence without altering 
the biological activity, whereas an "essential" amino 
acid residue is required for biological activity. For 

3 5 example, amino acid residues that are not conserved or 
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of different individuals. This can be readily carried 
out by using hybridization probes to identify the same 
genetic locus in a variety of individuals. Any and all 
such nucleotide Variations and resulting amino acid 
5 polymorphisms or variations that are the result of 
natural allelic variation and that do not alter the 
functional activity are intended to be within the scope 
of the invention. 

Moreover, nucleic acid molecules encoding proteins 

10 of- the invention from other species (homologues) , which 
have a nucleotide sequence which differs from that of the 
protein described herein are intended to be within the 
scope of the invention. Nucleic acid molecules 
corresponding to natural allelic variants and homologues 

15 of a cDNA of the invention can be isolated based on their 
identity to the nucleic acid molecule disclosed herein 
using a cDNA, or a portion thereof, as a hybridization 
probe according to standard hybridization techniques 
under stringent hybridization conditions. For example, a 

2 0 cDNA encoding a soluble form of a membrane -bound protein 

of the invention isolated based on its hybridization to a 
nucleic acid molecule encoding all or part of the 
membrane -bound form. Likewise, a cDNA encoding a 
membrane -bound form can be isolated based on its 

25 hybridization to a nucleic acid molecule encoding all or 
part of the soluble form. 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

30 800, 900, 1000, or 1290) nucleotides in length and 

hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence, preferably 
the coding sequence, of SEQ ID NO: 1, "3, 4, or 6 , the cDNA 
of ATCC , the cDNA of ATCC , or a complement 

3 5 thereof. 
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acids with basic side chains (e.g., lysine, arginine, 
histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, 
5 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine) . 

10 Alternatively, mutations can be introduced randomly along 
all or part of the coding sequence, such as by 
saturation mutagenesis, and the resultant mutants can be 
screened for biological activity to identify mutants that: 
retain activity. Following mutagenesis, the encoded 

15 protein can be expressed recombinantly and the activity 
of the protein can be determined. 

In a preferred embodiment, a mutant polypeptide that 
is a variant of a polypeptide of the invention can be 
assayed for: (1) the ability to form protein: protein 

20 interactions with proteins in a signalling pathway of the 
polypeptide of the invention; (2) the ability to bind a 
ligand of the polypeptide of the invention; or (3) the 
ability to bind to an intracellular target protein of the 
polypeptide of the invention. In yet another preferred 

25 embodiment, the mutant polypeptide can be assayed for the 
ability to modulate cellular proliferation or cellular 
differentiation. 

The present invention encompasses antisense nucleic 
acid molecules, i.e., molecules which are complementary 

30 to a sense nucleic acid encoding a polypeptide of the 

invention, e.g., complementary to the coding strand of a 
double -stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 

35 nucleic acid can be complementary to an entire coding 
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only semi -conserved among homologues of various species 
may be non-essential for activity and thus would be 
likely targets for alteration. Alternatively, amino acid 
residues that are conserved among the homologues of 
5 various species (e.g., murine and human) may be essential 
for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention 
pertains to nucleic acid molecules encoding a polypeptide 

10 of the invention that contain changes in amino acid 
residues that are not essential for activity. Such 
polypeptides differ in amino acid sequence from.SEQ ID 
NO: 2, 5, 8, and 13 yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule 

15 includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 
45% identical, 65%, 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of any of SEQ ID N0:2, 5, 8, or 
13. 

2 0 An isolated nucleic acid molecule encoding a variant 
protein can be created by introducing one or more 
nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, the cDNA 
of ATCC , or the cDNA of ATCC such that one 

2 5 or more amino acid substitutions, additions or deletions 

are introduced into the encoded protein. Mutations can 
be introduced by standard techniques, such as site- 
directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are 

3 0 made at one or more predicted non-essential amino acid 

residues. A "conservative amino acid substitution" is 
one z.n which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families 
of amino acid residues having similar side chains have 
35 been defined in the art. These families include amino 
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methoxycarboxymethyluracil , 5-methoxyuracil , 2- 
methylthio -N6 - isopentenylacienine , uracil - 5 -oxyacet ic acid 
(v) , wybutoxosine , pseudouracil , queosine, 2- 
thiocytosine , 5 -methyl -2 - thiouracil , 2 -thiouracil , 4 - 
5 thiouracil, 5-methyluracil , uracil-5-oxyacetic acid 
methylester, uracil -5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- ( 3 -amino- 3 -N-2 -carboxypropyl ) uracil, 
(acp3)w, and 2 , 6 -diaminopurine . Alternatively, the 
antisense nucleic acid can be produced biologically using 

10 an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of 
interest, described further in the following subsection) . 

15 The antisense nucleic acid molecules of the 

invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a selected 
polypeptide of the invention to thereby inhibit 

20 expression, e.g., by inhibiting transcription and/or 
translation. The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, 
for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific 

2 5 interactions in the major groove of the double helix. An 

example of a route of administration of antisense nucleic 
acid molecules of the invention includes direct injection 
ac a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 

3 0 then administered systemically . For example, for 

systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
3 5 or antibodies which bind to cell surface receptors or 
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strand, or to only a portion thereof, e.g., all or part 
of the protein coding region (or open reading frame) . An 
antisense nucleic acid molecule can be antisense to all 
or part of a noncoding region of the coding strand of a 
5 nucleotide sequence encoding a polypeptide of the 
invention. The noncoding regions ( " 5' and 3' 
untranslated regions") are the 5' and 3' sequences which 
flank the coding region and are not translated into amino 
acids . 

10 An antisense oligonucleotide can be, for example, 

about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides 
in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 

15 For example, an antisense nucleic acid (e.g., an 

antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 

20 stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 5- 

25 f luorouracil , 5-bromouracil , 5-chlorouracil , 5- 

iodouracil, hypoxanthine , xanthine, 4 -acetylcytosine , 5- 
(carboxyhydroxyl methyl) uracil , 5 - 
carboxyme thylaminome thyl - 2 - thiour idine , 5 - 
carboxymethylaminomethyluracil , dihydrouracil , beta-D- 

3 0 galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine , 1 -methyl inosine , 2 , 2 -dimethylguanine , 2 - 
methyladenine , 2 -methylguanine , 3 -methylcytosine , 5 - 
methylcytosine , N6 -adenine, 7 -methylguanine, 5*- 
me t hy 1 ami nomethyluracil, 5-met hoxy am i nome t hy 1 - 2 - 

35 thiouracil, beta-D-mannosylqueos.ine, 5'- 
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Alternatively, an mRNA encoding a polypeptide of the 
invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA 
molecules. See, e.g., Bartel and Szostak (1993) Science 
5 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, expression of a polypeptide of the invention can 
be inhibited by targeting nucleotide sequences 

10 complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or 
enhancer) to form triple helical structures that prevent 
transcription of the gene in target cells. See generally 
Helene (1991) Anticancer Drug Des. 6(6) : 569-84; Helene 

15 (1992) Ann. N.Y. Acad. Sci . 660:27-36; and Maher (1992) 
Bioassays 14 (12) : 807-15 . 

In preferred embodiments, the nucleic acid molecules 
of the invention can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the 

20 stability, hybridization, or solubility of the molecule. 
For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al . (1996) Bioorganic & Medicinal 
Chemistry 4(1) : 5-23) . As used herein, the terms 

25 "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 

30 allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996), supra; Perry-O' Keef e et al . (1996) Proc. Natl. 

35 Acad. Sci. C7SA 93: 14670-675. 


WO 00/18800 


PCT/US99/22818 


antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
5 antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

An antisense nucleic acid molecule of the invention 
can be an a-anomeric nucleic acid molecule. An a- 

10 anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual 0-units, the strands run parallel 
to each other (Gaultier et al. (1987) Nucleic Acids Res. 
15:6625-6641) . The antisense nucleic acid molecule can 

15 also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 215:327- 
330) . 

The invention also encompasses ribozymes. Ribozymes 

20 are catalytic RNA molecules with ribonuclease activity 
which are capable of cleaving a single -stranded nucleic 
acid, such as an mRNA, to which they have a complementary 
region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 

25 334:585-591)) can be used to catalytically cleave mRNA 

transcripts to thereby inhibit translation of the protein 
encoded by the mRNA. A ribozyme having specificity for a 
nucleic acid molecule encoding a polypeptide of the 
invention can be designed based upon the nucleotide 

3 0 sequence of a cDNA disclosed herein. For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be 
constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence 
to be cleaved in a Cech et al . U.S. Patent No. 4,987,071; 

35 and Cech et al . U.S. Patent No. 5,116,742. 
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PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al . (1996) Nucleic Acids Res. 
24 (17) :3357-63) . Alternatively, chimeric molecules can 
5 be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al . (1975) Bioorganic Med. Chem. Lett. 
5:1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 

10 targeting host cell receptors in vivo) , or agents 

facilitating transport across the cell membrane {see, 
e.g., Letsinger et al . (1989) Proc. Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al , (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. W0 88/09810) or 

15 the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 
modified with hybridization- triggered cleavage agents 
(see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents {see, e.g., Zon (1988) Pharm. 

2 0 Res. 5:53 9-549) . To this end, the oligonucleotide may be 

conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

II. Isolated Proteins and Antibodies 
25 One aspect of the invention pertains to isolated 

proteins and polypeptides of the invention, and 
biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to 
raise antibodies directed against a polypeptide of the 

3 0 invention. In one embodiment, the native polypeptide can 

be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein 
purification techniques. In another embodiment, 
polypeptides of the invention are produced by recombinant 
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PNAs can be used in therapeutic and diagnostic 
applications. For example, PNAs can be used as antisense 
or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or 
5 translation arrest or inhibiting replication. PNAs can 
also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PCR clamping; 
as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup 

10 (1996), supra; or as probes or primers for DNA sequence 
and hybridization (Hyrup (1996), supra; Perry-O' Keef e et 
al. (1996) Proc. Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., 
to enhance their stability or cellular uptake, by 

15 attaching lipophilic or other helper groups to PNA, by 
the formation of PNA- DNA chimeras, or by the use of 
liposomes or other techniques of drug delivery known in 
the art. For example, PNA -DNA chimeras can be generated 
which may combine the advantageous properties of PNA and 

20 DNA. Such chimeras allow DNA recognition enzymes, e.g., 
RNAse H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding 
affinity and specificity. PNA -DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of 

25 base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996) , supra) . The synthesis of 
PNA -DNA chimeras can be performed as described in Hyrup 
(1996), supra, and Finn et al . (1996) Nucleic Acids Res. 
24 (17) : 3357-63 . For example, a DNA chain can be 

30 synthesized on a solid support using standard 
phosphor ami dite coupling chemistry and modified 
nucleoside analogs. Compounds such as 5'-(4- 
methoxycricyl) amino- 5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 

35 DNA (Mag et al . (1989) Nucleic Acids Res. 17:5973-88). 
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fewer amino acids than the full length protein, and 
exhibit at least one activity of the corresponding full- 
length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 
5 the corresponding protein. A biologically active portion 
of a protein of the invention can be a polypeptide which 
is, for example, 10, 25, 50, 100 or more amino acids in 
length. Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 

10 prepared by recombinant techniques and evaluated for one 
or more of the functional activities of the native form 
of a polypeptide of the invention. 

Preferred polypeptides have the amino acid sequence 
of SEQ ID NO:2, 5, 7-11, and 12-16. Other useful 

15 proteins are substantially identical (e.g., at least 

about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99%) to 
SEQ ID NO: 2, 5, 7-11, and 12-16 and retain the functional 
activity of the protein of the corresponding naturally- 
occurring protein yet differ in amino acid sequence due 

20 to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid 
sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 

25 nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 

30 residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 

35 of identical positions/total # of positions (e.g., 
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DNA techniques. Alternative to recombinant expression, a 
polypeptide of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 
An "isolated" or "purified" protein or biologically 
5 active portion thereof is substantially free of cellular 
material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. The language 

10 "substantially free of cellular material" includes 

preparations of protein in which the protein is separated 
from cellular components of the cells from which it is 
isolated or recombinant ly produced. Thus, protein that 
is substantially free of cellular material includes 

15 preparations of protein having less than about 30%, 20%, 
10%, or 5% (by dry weight) of heterologous protein (also 
referred to herein as a "contaminating protein") . When 
the procein or biologically active portion thereof is 
recombinantly produced, it is also preferably 

20 substantially free of culture medium, i.e., culture 

medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When the protein is 
produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 

25 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the procein. Accordingly such preparations of the 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or compounds other than 

3 0 the polypeptide of interest. 

Biologically active portions of a polypeptide of the 
invention include polypeptides comprising amino acid 
sequences sufficiently identical to or derived from the 
amino acid sequence of the protein (e.g., the amino acid 

3 5 sequence of SEQ ID NO: 2, 5, 8, or 13) , which include 
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table, a gap length penalty of 12, and a gap penalty of 4 
can be used. 

The percent identity between two sequences, can be 
determined using techniques similar to those described 
5 above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides chimeric or fusion 
proteins. As used herein, a "chimeric protein' 1 or 
"fusion protein" comprises all or part (preferably 

10 biologically active) of a polypeptide of the invention 
operably linked to a heterologous polypeptide (i.e., a 
polypeptide other than the same polypeptide of the 
invention) . Within the fusion protein, the term 
"operably linked" is intended to indicate that the 

15 polypeptide of the invention and the heterologous 
polypeptide are fused in- frame to each other. The 
heterologous polypeptide can be fused to the N- terminus 
or C-terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in 

20 which the polypeptide of the invention is fused to the C- 
terminus of GST sequences. Such fusion proteins can 
facilitate the purification of a recombinant polypeptide 
of the invention. 

In another embodiment, the fusion protein contains a 

25 heterologous signal sequence at its N-terminus. For 
example, the native signal sequence of a polypeptide of 
the invention can be removed and replaced with a signal 
sequence from another protein. For example, the gp67 
secretory sequence of the baculovirus envelope protein 

30 can be used as a heterologous signal sequence (Current 
Protocols in Molecular Biology, Ausubel et al . , eds . , 
John Wiley & Sons, 1992) . Other examples of eukaryotic 
heterologous signal sequences include the secretory 
sequences of melittin and human placental alkaline 

35 phosphatase (Stratagene; La Jolla, California) . In yet 
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overlapping positions) x 100) . Preferably, the two 
sequences are the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
5 algorithm. A preferred, non- limiting example of a 

mathematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 

10 90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) J. 
Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 

15 to a nucleic acid molecules of the invention. BLAST 
protein searches can be performed with the XBLAST 
program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to a protein molecules of the 
invention. To obtain gapped alignments for comparison 

20 purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 
Alternatively, PSI -Blast can be used to perform an 
iterated search which detects distant relationships 
between molecules. Id. When utilizing BLAST, Gapped 

2 5 BLAST, and PSI -Blast programs, the' default parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another 
preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the 

30 algorithm of Myers and Miller, (1.988) CABIOS 4:11-17. 

Such an algorithm is incorporated into the ALIGN program 
(version 2.0) which is part of the GCG sequence alignment 
software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAM120 weight residue 


WO 00/18800 


PCT/US99/22818 


- 40 - 

sequence (see, e.g., Ausubel et al., supra). Moreover, 
many expression vectors are commercially available that 
already encode a fusion moiety (e.g., a GST polypeptide) 
A nucleic acid encoding a polypeptide of the invention 
5 can be cloned into such an expression vector such that 
the fusion moiety is linked in-frame to the polypeptide 
of the invention. 

A signal sequence of a polypeptide of the invention 
(SEQ ID NO: 7 or 12) can be used to facilitate secretion 

10 and isolation of a secreted protein or other protein of 
interest. Signal sequences are typically characterized 
by a core of hydrophobic amino acids which are generally 
cleaved from the mature protein during secretion in one 
or more cleavage events. Such signal peptides contain 

15 processing sites that allow cleavage of the signal 

sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to 
the described polypeptides having a signal sequence, as 
well as to the signal sequence itself and to the 

20 polypeptide in the absence of the signal sequence (i.e., 
the cleavage products) . In one embodiment, a nucleic 
acid sequence encoding a signal sequence of the invention 
can be operably linked in an expression vector to a 
protein of interest, such as a protein which is 

25 ordinarily not secreted or is otherwise difficult to 
isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the 
expression vector is transformed, and the signal sequence 
is subsequently or concurrently cleaved. The protein can 

30 then be readily purified from the extracellular medium by 
art recognized methods. Alternatively, the signal 
sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a 
GST domain. 
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another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et 
al . , supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 
5 In yet another embodiment, the fusion protein is an 

immunoglobulin fusion protein in which all or part of a 
polypeptide of the invention is fused to sequences 
derived from a member of the immunoglobulin protein 
family. The immunoglobulin fusion proteins of the 

10 invention can be incorporated into pharmaceutical 

compositions and administered to a subject to inhibit an 
interaction between a ligand (soluble or membrane -bound) 
and a protein on the surface of a cell (receptor) , to 
thereby suppress signal transduction in vivo. The 

15 immunoglobulin fusion protein can be used to affect the 
bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction 
may be useful therapeutically, both for treating 
proliferative and dif f erentiative disorders and for 

20 modulating (e.g. promoting or inhibiting) cell survival. 
Moreover, the immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce antibodies 
directed against a polypeptide of the invention in a 
subject, to purify ligands and in screening assays to 

25 identify molecules which inhibit the interaction of 
receptors with ligands. 

Chimeric and fusion protein of the invention can be 
produced by standard recombinant DNA techniques. In 
another embodiment, the fusion gene can be synthesized by 

3 0 conventional techniques including automated DNA 

synthesizers. Alternatively, PCR amplification of gene 
fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 

3 5 annealed and reamplified to generate a chimeric gene 


WO 00/18800 


PCT/US99/22818 


- 42 - 

invention for agonist or antagonist activity. In one 
embodiment, a variegated library of variants is generated 
by combinatorial mutagenesis at the nucleic acid level 
and is encoded by a variegated gene library. A 
5 variegated library of variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 
degenerate set of potential protein sequences is 
expressible as individual polypeptides, or alternatively, 

10 as a set of larger fusion proteins (e.g., for phage 
display) . There are a variety of methods which can be 
used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate 
oligonucleotide sequence. Methods for synthesizing 

15 degenerate oligonucleotides are known in the art (see, 
e.g., Narang (1983) Tetrahedron 39:3; Itakura et al . 
(1984) Armu. Rev. Biochem. 53:323; Itakura et al . (1984) 
Science 198:1056; Ike et al . (1983) Nucleic Acid Res. 
11:477) . 

20 In addition, libraries of fragments of the coding 

sequence of a polypeptide of the invention can be used to 

generate a variegated population of polypeptides for 

screening and subsequent selection of variants. For 

example, a library of coding sequence fragments can be 

o 

25 generated by treating a double stranded PCR fragment of 
the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing 
the DNA to form double stranded DNA which can include 

3 0 sense/antisense pairs from different nicked products, 

removing single stranded portions from reformed duplexes 
by treatment with SI nuclease, and ligating the resulting 
fragment: library into an expression vector. By this 
method, an expression library can be derived which 


WO 00/18800 


PCTYUS99/22818 


- 41 - 

In another embodiment, the signal sequences of the 
present invention can be used to identify regulatory 
sequences, e.g., promoters, enhancers, repressors. Since 
signal sequences are the most amino- terminal sequences of 
5 a peptide, it is expected that the nucleic acids which 
flank the signal sequence on its . amino- terminal side will 
be regulatory sequences which affect transcription. 
Thus, a nucleotide sequence which encodes all or a 
portion of a signal sequence can be used as a probe to 

10 identify and isolate signal sequences and their flanking 
regions, and these flanking regions can be studied to 
identify regulatory elements therein. 

The present invention also pertains to variants of the 
polypeptides of the invention. Such variants have an 

15 altered amino acid sequence which can function as either 
agonists (mimetics) or as antagonists. Variants can be 
generated by mutagenesis, e.g., discrete point mutation 
or truncation. An agonist can retain substantially the 
same, or a subset, of the biological activities of the 

20 naturally occurring form of the protein. An antagonist 
of a protein can inhibit one or more of the activities of 
the naturally occurring form of the protein by, for 
example, competitively binding to a downstream or 
upstream member of a cellular signaling cascade which 

25 includes the protein of interest. Thus, specific 

biological effects can be elicited by treatment with a 
variant of limited function. Treatyent of a subject with 
a variant having a subset of the biological activities of 
the naturally occurring form of the protein can have 

3 0 fewer side effects in a subject relative to treatment 
with the naturally occurring form of the protein. 

Variants of a protein of the invention which function 
as either agonists (mimetics) or as antagonists can be 
identified by screening combinatorial libraries of 

35 mutants, e.g., truncation mutants, of the protein of the 
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protein, e.g., hydrophilic regions. Figures 2 and 4 are 
hydrophobic ity plots of the proteins of the invention. 
These plots or similar analyses can be used to identify 
hydrophilic regions. 
5 An immunogen typically is used to prepare antibodies by 
immunizing a suitable subject, (e.g., rabbit, goat, mouse 
or other mammal) . An appropriate immunogenic preparation 
can contain, for example, recombinantly expressed 
chemically synthesized polypeptide. The preparation can 

10 further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar imrnunostimulatory agent. 

Accordingly, another aspect of the invention pertains 
to antibodies directed against a polypeptide of the 
invention. The term "antibody" as used herein refers to 

15 immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules 
that contain an antigen binding site which specifically 
binds an antigen, such as a polypeptide of the invention. 
A molecule which specifically binds to a given 

2 0 polypeptide of the invention is a molecule which binds 

the polypeptide, but does not substantially bind other 
molecules in a sample, e.g., a biological sample, which 
naturally contains the polypeptide. Examples of 
immunologically active portions of immunoglobulin 

25 molecules include F(ab) and F(ab') 2 fragments which can be 
generated by treating the antibody with an enzyme such as 
pepsin. The invention provides polyclonal and monoclonal 
antibodies. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers 

30 to a population of antibody molecules that contain only 
one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described 
above by immunizing a suitable subject with a polypeptide 

3 5 of the invention as an immunogen. The antibody titer in 
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encodes N- terminal and internal fragments of various 
sizes of the protein of interest. 

Several techniques are known in the art for screening 
gene products of combinatorial libraries made by point 
5 mutations or truncation, and for screening cDNA libraries 
for gene products having a selected property. The most 
widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries 
typically include cloning the gene library into 

10 replicable expression vectors, transforming appropriate 
cells with the resulting library of vectors, and 
expressing the combinatorial genes under conditions in 
which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product 

15 was detected. Recursive ensemble mutagenesis (REM) , a 
technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with 
the screening assays to identify variants of a protein of 
the invention (Arkin and Yourvan (1992) Proc. Natl. Acad. 

20 Sci. USA 89:7811-7815; Delgrave et al . (1993) Protein 
Engineering 6 (3 ): 327-331) . 

An isolated polypeptide of the invention, or a fragment 
thereof, can be used as an immunogen to generate 
antibodies using standard techniques for polyclonal and 

25 monoclonal antibody preparation. The full-length 

polypeptide or protein can be used or, alternatively, the 
invention provides antigenic peptide fragments for use as 
immunogens . The antigenic peptide of a protein of the 
invention comprises at least 8 (preferably 10, 15, 20, or 

30 30) amino acid residues of the amino acid sequence of SEQ 
ID NO: 8 or 13 and encompasses an epitope of the protein 
such that an antibody raised against the peptide forms a 
specific immune complex with the protein. 

Preferred epitopes encompassed by the antigenic peptide 

35 are regions that are located on the surface of the 
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particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
5 Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 
92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
10 Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J. 12:725- 
734 . 

Additionally, recombinant antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both 

15 human and non-human portions, which can be made using 

standard recombinant DNA techniques , are within the scope 
of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques 
known in the art, for example using methods described in 

20 PCT Publication No. WO 87/02671; European Patent 

Application 184,187; European Patent Application 171,496; 
European Patent Application 173,494; PCT Publication No. 
WO 86/01533; U.S. Patent No. 4,816,567; European Patent 
Application 125,023; Better et al . (1988) Science 

25 240:1041-1043; Liu et al . (1987) Proc . Natl. Acad. Sci . 
USA 84:3439-3443; Liu et al . (1987) J. Jmrnunol . 
139:3521-3526; Sun et al . (1987) Proc. Natl. Acad. Sci. 
USA 84:214-218; Nishimura et al . (1987) Cane. Res. 
47:999-1005; Wood et al. (1985) Nature 314:446-449; and 

30 Shaw et al . (1988) J. Natl. Cancer Inst. 80:1553-1559); 
Morrison (1985) Science 229:1202-1207; Oi et al . (1986) 
Bio/Techniques 4:214; U.S. Patent 5,225,539; Jones et al . 
(1986) Nature 321:552-525; Verhoeyan et al . (1988) 
Science 23 9:1534; and Beidler et al . (1988) J. Immunol. 

35 141:4053-4060. 


WO 00/18800 


PCT/US99/22818 


- 45 - 

the immunized subject can be monitored over time by- 
standard techniques, such as with an enzyme linked 
immunosorbent assay (ELISA) using immobilized 
polypeptide. If desired, the antibody molecules can be 
5 isolated from the mammal (e.g., from the blood) and 
further purified by well-known techniques, such as 
protein A chromatography to obtain the IgG fraction. At 
an appropriate time after immunization, e.g., when the 
specific antibody titers are highest, antibody -producing 

10 cells can be obtained from the subject and used to 
prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique originally described by 
Kohler and Milstein (1975) Nature 256:495-497, the human 
B cell hybridoma technique (Kozbor et al. (1983) Immunol. 

15 Today 4:72), the EBV-hybridoma technique (Cole et al . 

(1985), Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96) or trioma techniques. The 
technology for producing hybridomas is well known (see 
generally Current Protocols in Immunology (1994) Coligan 

20 et al. (eds.) John Wiley & Sons, Inc., New York, NY). 
Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of 
interest, e.g., using a standard ELISA assay. 

25 Alternative to preparing monoclonal antibody-secreting 
hybridomas, a monoclonal antibody directed against a 
polypeptide of the invention can be identified and 
isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g., an antibody phage display 

30 library) with the polypeptide of interest. Kits for 
generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP™ Phage Display Kit, Catalog No. 

35 240612) . Additionally, examples of methods and reagents 
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An antibody directed against a polypeptide of the 
invention (e.g., monoclonal antibody) can be used to 
isolate the polypeptide by standard techniques, such as 
affinity chromatography or immunoprecipitation. 
5 Moreover, such an antibody can be used to detect the 
protein (e.g., in a cellular lysate or cell supernatant) 
in order to evaluate the abundance and pattern of 
expression of the polypeptide. The antibodies can also 
be used diagnostically to monitor protein levels in 

10 tissue as part of a clinical testing procedure, e.g., to, 
for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 

15 groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, /?-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 

20 group complexes include streptavidin/biotin and 

avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 

25 of a luminescent material includes luminol; examples of 
bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 I, 131 I, 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 
3 0 Another aspect of the invention pertains to vectors, 
preferably expression vectors, containing a nucleic acid 
encoding a polypeptide of the invention (or a portion 
thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
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Completely human antibodies are particularly desirable 
for therapeutic treatment of human patients. Such 
antibodies can be produced using transgenic mice which 
are incapable of expressing endogenous immunoglobulin 
5 heavy and light chains genes, but which can express human 
heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, 
e.g., all or a portion of a polypeptide of the invention. 
Monoclonal antibodies directed against the antigen can be 

10 obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 
mutation. Thus, using such a technique, it is possible 

15 to produce therapeutically useful IgG, IgA and IgE 
antibodies . For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Int. -Rev. Immunol. 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and 

20 human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA) # can be 

25 engaged to provide human antibodies directed against a 
selected antigen using technology similar to that 
described above. 

Completely human antibodies which recognize a selected 
epitope can be generated using a technique referred to as 

30 "guided selection." In this approach a selected 

non-human monoclonal antibody, e.g., a murine antibody, 
is used to guide the selection of a completely, human 
antibody recognizing the same epitope (Jespers et al . 
(1994) Bio/Technology 12:899-903). 
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the vector is introduced into the host cell) . The term 
"regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., 
polyadenylation signals) . Such regulatory sequences are 
5 described, for example, in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990) . Regulatory sequences include those 
which direct constitutive expression of a nucleotide 
sequence in many types of host cell and those which 

10 direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory 
sequences) . It will be appreciated by those skilled in 
the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to 

15 be transformed, the level of expression of protein 

desired, etc. The expression vectors of the invention 
can be introduced into host cells to thereby produce 
proteins, or peptides, including fusion proteins or 
peptides, encoded by nucleic acids as described herein. 

20 The recombinant expression vectors of the invention can 
be designed for expression of a polypeptide of the 
invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells such as E, coli, insect cells (using 
baculovirus expression vectors) , yeast cells or mammalian 

25 cells. Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant 
expression vector can be transcribed and translated in 
vitro, tor example using T7 promoter regulatory sequences 
and T7 polymerase. 

3 0 Expression of proteins in prokaryotes is most often 
carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 

3 5 encoded therein, usually to the amino terminus of the 
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nucleic acid to which it has been linked. One type of 
vector is a "plastnid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
5 wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 

10 vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 

15 expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 

20 vectors (e.g. , replication defective retroviruses, 

adenoviruses and adeno-associated viruses) , which serve 
equivalent functions . 

The recombinant expression vectors of the invention 
comprise a nucleic acid of the invention in a form 

25 suitable for expression of the nucleic acid in a host 

cell. This means that the recombinant expression vectors 
include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, which 
is operably linked to the nucleic acid sequence to be 

3 0 expressed. Within a recombinant expression vector, 

"operably linked" is intended to mean that the nucleotide 
sequence of interest is linked to the regulatory 
sequence (s) in a manner which allows for expression of 
the nucleotide sequence (e.g., in an in vitro 

3 5 transcript ion/ translation system or in a host cell when 
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recombinant protein (Gottesman, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 
San Diego, California (1990) 119-128) . Another strategy 
is to alter the nucleic acid sequence of the nucleic acid 
5 to be inserted into an expression vector so that the 
individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al . (1992) 
Nucleic Acids Res, 20:2111-2118). Such alteration of 
nucleic acid sequences of the invention can be carried 

10 out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in 
yeast S. cerivisae include pYepSecl (Baldari et al . 
(1987) EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 

15 (1982) Cell 30:933-943), pJRY88 (Schult z et al . (1987) 
Gene 54:113-123), pYES2 (Invitrogen Corporation, San 
Diego, CA) , and pPicZ (Invitrogen Corp, San Diego, CA) . 

Alternatively, the expression vector is a baculovirus 
expression vector. Baculovirus vectors available for 

20 expression of proteins in cultured insect cells (e.g., Sf 
9 cells) include the pAc series (Smith et al . (1983) Mol . 
Cell Biol. 3:2156-2165) and the pVL series (Lucklow and 
Summers (1989) Virology 170 : 31-39) . 

In yet another embodiment, a nucleic acid of the 

25 invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 

30 vector's control functions are often provided by viral 
regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 

35 cells see chapters 16 and 17 of Sambrook et al . , supra. 
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recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli 
expression vectors include pTrc (Amann et al., (1988) 
Gene 69:301-315) and pET lid (Studier et al . , Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
host RNA polymerase transcription from a hybrid trp-lac 
fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21JDE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

One strategy tc maximize recombinant protein expression 
in E. coli is to express the protein in a host bacteria 
with an impaired capacity to proteolytically cleave the 
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cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen which 
direct constitutive, tissue specific or cell type 
specific expression of antisense RNA. The antisense 
5 expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense 
nucleic acids are produced under the control of a high 
efficiency regulatory region, the activity of which can 
be determined by the cell type into which the vector is 

10 introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub et al. 
(Reviews - Trends in Genetics, Vol. 1(1) 1986). 

Another aspect of the invention pertains to host cells 
into which a recombinant expression vector of the 

15 invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 

20 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein . 

25 A host cell can be any prokaryotic (e.g., E. coli) or 
eukaryotic cell (e.g., an insect cell, yeast, or a 
mammalian cell) . 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 

30 transfection techniques. As used herein, the terms 

" transformation" and "transfection" are intended to refer 
to a variety of arc -recognized techniques for introducing 
foreign nucleic acid into a host cell, including calcium 
phosphace or calcium chloride co-precipitation, DEAE- 

35 dextran-mediated transfection, lipofection, or 


WO 00/18800 PCT/US99/22818 

- 53 - 

In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
5 express the nucleic acid) . Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 
promoter (liver-specific; Pinkert et al. (1987) Genes 
Dev. 1:268-277) , lymphoid- specif ic promoters (Calame and 

10 Eaton (1988) Adv. Immunol. 43:235-275), in particular 
promoters of T cell receptors (Winoto and Baltimore 
(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al . (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron- specif ic promoters (e.g., the 

15 neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci . USA 86:5473-5477), pancreas-specific 
promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 

20 Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the a- fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

25 The invention further provides a recombinant expression 
vector comprising a DNA molecule of the invention cloned 
into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a 
regulatory sequence in a manner which allows for 

30 expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to the mRNA encoding a 
polypeptide of the invention. Regulatory sequences 
operably linked :o a nucleic acid cloned in the antisense 
orientation can be chosen which direct the continuous 

35 expression of the antisense RNA molecule in a variety of 
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non-human transgenic animals in which exogenous sequences 
encoding a polypeptide of the invention have been 
introduced into their genome or homologous recombinant 
animals in which endogenous encoding a polypeptide of the 
5 invention sequences have been altered. Such animals are 
useful for studying the function and/or activity of the 
polypeptide and for identifying and/or evaluating 
modulators of polypeptide activity. As used herein, a 
"transgenic animal" is a -non -human animal, preferably a 

10 mammal, more preferably a rodent such as a rat or mouse, 
in which one or more of the cells of the animal includes 
a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, 
chickens, amphibians, etc. A transgene is exogenous DNA 

15 which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the 
genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used 

2 0 herein, an "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in 
which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 

25 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal . 

A transgenic animal of the invention can be created by 
introducing nucleic acid encoding a polypeptide of the 
invention (or a homologue thereof) into the male 

30 pronuclei of a fertilized oocyte, e.g., by 

microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the 

35 efficiency of expression of the transgene. A tissue- 
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electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al . 
{supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known 
5 that, depending upon the expression vector and 

transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 

10 antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Cells stably 
transfected with the introduced nucleic acid can be 

15 identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention, such as a prokaryotic or 
eukaryotic host cell in culture, can be used to produce a 

20 polypeptide of the invention. Accordingly, the invention 
further provides methods for producing a polypeptide of 
the invention using the host cells of the invention. In 
one embodiment, the method comprises culturing the host 
cell of invention (into which a recombinant expression 

25 vector encoding a polypeptide of the invention has been 
introduced) in a suitable medium such that the 
polypeptide is produced. In another embodiment, the 
method further comprises isolating the polypeptide from 
the medium or the host cell. 

30 The host cells of the invention can also be used to 

produce nonhuman transgenic animals. For example, in one 
embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences 
encoding a polypeptide of the invention have been 

3 5 introduced. Such host cells can then be used to create 


WO 00/18800 


PCT7US99/22818 


- 58 - 

5' and 3' ends by additional nucleic acid of the gene to 
allow for homologous recombination to occur between the 
exogenous gene carried by the vector and an endogenous 
gene in an embryonic stem cell. The additional flanking 
nucleic acid sequences are of sufficient length for 
successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both 
at the 5' and 3' ends) are included in the vector (see, 
e.g., Thomas and Capecchi (1987) Cell 51:503 for a 
description of homologous recombination vectors) . The 
vector is introduced into an embryonic stem cell line 
(e.g., by electroporation) and cells in which the 
introduced gene has homologously recombined with the 
endogenous gene are selected {see, e.g., Li et al . (1992) 
Cell 69:915). The selected cells are then injected into 
a blastocyst of an animal (e.g., a mouse) to form 
aggregation chimeras (see, e.g., Bradley in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 113- 
152) . A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous 
recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication NOS . WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can 
be produced which contain selected systems which allow 
for regulated expression of the transgene. One example 
of such a system is the cre/loxP recombinase system of 
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specific regulatory sequence (s) can be operably linked to 
the transgene to direct expression of the polypeptide of 
the invention to particular cells. Methods for 
generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent NOS . 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 
Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1986) . Similar methods are used for 
production of othei j transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the 
animals. A transgenic founder animal can then be used to 
breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying the transgene can 
further be bred to other transgenic animals carrying 
other transgenes. 

To create an homologous recombinant animal, a vector is 
prepared which contains at least a portion of a gene 
encoding a polypeptide of the invention into which a 
deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the gene. In 
a preferred embodiment, the vector is designed such that, 
upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" 
vector) . Alternatively, the vector can be designed such 
that, upon homologous recombination, the endogenous gene 
is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region 
can be altered to thereby alter the expression of the 
endogenous protein) . In the homologous recombination 
vector., the altered portion of the gene is flanked at its 
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Supplementary active compounds can also be incorporated 
into the compositions. 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
5 administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, 
subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. 
Solutions or suspensions used for parenteral, 

10 intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 

15 methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or 

20 dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use 
25 include sterile aqueous solutions (where water soluble) 
or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or 
dispersions. For intravenous administration, suitable 
carriers include physiological saline, bacteriostatic 
30 water, Cremophor EL™ (BASF; Parsippany, NJ) or phosphate 
buffered saline (PBS) . In all cases, the composition 
must be sterile and should be fluid to the extent that 
easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be 
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bacteriophage PI. For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al . (1992) Proc. 
Natl. Acad. Sci. USA 89:6232-6236. Another example of a 
recombinase system is the FLP recombinase system of 
5 Saccharomyces cerevisiae (O'Gorman et al . (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used 
to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase 
and a selected protein are required. Such animals can be 

10 provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 
Clones of the non-human transgenic animals described 

15 herein can also be produced according to the methods 

described in Wilmut et al . (1997) Nature 385:810-813 and 
PCT Publication NOS . WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and 

20 antibodies (also referred to herein as "active 

compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. 
Such compositions typically comprise the nucleic acid 
molecule, protein, or antibody and a pharmaceutically 

2 5 acceptable carrier. As used herein the language 

"pharmaceutically acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with 

30 pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well 
known in the art. Except insofar as any conventional 
media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. 
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Oral compositions generally include an inert diluent or 
an edible carrier. They can be enclosed in gelatin 
capsules or compressed into tablets. For the purpose of 
oral therapeutic administration, the active ' compound can 
be incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can 
also be prepared using a fluid carrier for use as a 
mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 
Pharmaceutically compatible binding agents, and/or 
adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring. 

For administration by inhalation, the compounds are 
delivered in the form of an aerosol spray from a 
pressurized container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

Systemic administration can also be by transmucosal or 
transdermal means. For tPransmucosal or transdermal 
administration, penetrants appropriate to the barrier to 
be permeated are used in the formulation. Such 
penetrants are generally known in the art, and include, 
for example, for rransmucosal administration, detergents, 
bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of 
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preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by 
the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 
chlorobutanoi , phenol, ascorbic acid, thimerosal, and the 
like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars 0 polyalcohols such 
as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the 
composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a polypeptide or 
antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients 
enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle 
which contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of 
preparation are vacuum drying and f reeze-drying which 
yields a powder of the active ingredient plus any 
additional des_red ingredient from a previously sterile- 
filtered solution thereof. 
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the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
5 active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 
100 mg/kg of body weight (generally 10 mg/kg to 20 
mg/kg) . If the antibody is to act in the brain, a dosage 
of 50 mg/kg to 100 mg/kg is usually appropriate. 

10 Generally, partially human antibodies and fully human 
antibodies have a longer half -life within the human body 
than other antibodies. Accordingly, lower dosages and 
less frequent administration is often possible. 
Modifications such as lipidation can be used to stabilize 

15 antibodies and to enhance uptake and tissue penetration 
(e.g., into the brain). A method for lipidation of 
antibodies is described by Cruikshank et al . ((1997) J*. 
Acquired Immune Deficiency Syndromes and Human 
Retrovirology 14:193) . 

2 0 The nucleic acid molecules of the invention can be 

inserted into vectors and used as gene therapy vectors. 
Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
25 (see, e.g., Chen et al . (1994) Proc. Natl. Acad. Sci. USA 
91:3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 

3 0 Alternatively, where the complete gene delivery vector 

can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can 
include one or more cells which produce the gene delivery 
system. 
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nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in 
the art . 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared 
with carriers that will protect the compound against 
rapid elimination from the body, such as a controlled 
release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters , and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
pharmaceutical^ acceptable carriers. These can be 
prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4 , 522, 811 . 

It is especially advantageous to formulate oral or 
parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
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A. Screening Assays 

The invention provides a method (also referred to 
herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 
(e.g., peptides, peptidomimetics, small molecules or 
other drugs) which bind to polypeptide of the invention 
or have a stimulatory or inhibitory effect on, for 
example, expression or activity of a polypeptide of the 
invention. 

In one embodiment, the invention provides assays for 
screening candidate or test compounds which bind to or 
modulate the accivity of the membrane -bound form of a 
polypeptide of the invention or biologically active 
portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous 
approaches in combinatorial library methods known in the 
art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase 
libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library 
method; and synthetic library methods using affinity 
chromatography selection. The biological library 
approach is limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des. 12:145) . 

Examples of methods for the synthesis of molecular 
libraries can be found in the art, for example in: 
DeWitt et al . (1993) Proc. Natl. Acad. Sci. USA 90:6909; 
Erb et al . (1994) Proc. Natl. Acad. Sci. USA 91:11422; 
Zuckermann et al . (1994). J. Med. Chem. 37:2678; Cho et 
al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 
Chem. Int. Ed. Engl. 33:2061; and Gallop et al . (1994) J. 
Med. Chem. 37:1233. 
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The pharmaceutical compositions can be included in a 
container, pack, or dispenser together with instructions 
for administration . 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 
assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) ; c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). For 
example, polypeptides of the invention can to used to (i) 
modulate cellular proliferation; (ii) modulate cellular 
differentiation; and (iii) modulate cell survival. The 
isolated nucleic acid molecules of the invention can be 
used to express proteins (e.g., via a recombinant 
expression vector in a host cell in gene therapy 
applications), to detect mRNA (e.g., in a biological 
sample) or a genetic lesion, and to modulate activity of 
a polypeptide of the invention. In addition, the 
polypeptides of the invention can be used to screen drugs 
or compounds which modulate activity or expression of a 
polypeptide of the invention as well as to treat 
disorders characterized by insufficient or excessive 
production of a protein of the invention or production 
of a form of a protein of the invention which has 
decreased or aberrant activity compared to the wild type 
protein. In addition, the antibodies of the invention 
can be used to detect and isolate a protein of the and 
modulate activity of a protein of the invention. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
uses thereof for treatments as described herein. 
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invention, or a biologically active portion thereof, on 
the cell surface with a known compound which binds the 
polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the 
5 ability of the test compound to interact with the 

polypeptide, wherein determining the ability of the test 
compound to interact with the polypeptide comprises 
determining the ability of the test compound to 
preferentially bind to the polypeptide or a biologically 
10 active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell -based assay 
comprising contacting a cell expressing a membrane -bound 
form of a polypeptide of the invention, or a biologically 
active portion thereof, on the cell surface with a test 

15 compound and determining the ability of the test compound 
to modulate (e.g., stimulate or inhibit) the activity of 
the polypeptide or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of the polypeptide or a biologically active 

20 portion thereof can be accomplished, for example, by 
determining the ability of the polypeptide protein to 
bind to or interact with a target molecule. 

Determining the ability of a polypeptide of the 
invention to bind to or interact with a target molecule 

25 can be accomplished by one of the methods described above 
for determining direct binding. As used herein, a 
"target molecule" is a molecule with which a selected 
polypeptide (e.g., a polypeptide of the invention binds 
or interacts with in nature, for example, a molecule on 

30 the surface of a cell which expresses the selected 

protein, a molecule on the surface of a second cell, a 
molecule in the extracellular milieu, a molecule 
associaced with the internal surface of a cell membrane 
or a cytoplasmic molecule. A target molecule can be a 
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Libraries of compounds may be presented in solution 
(e.g., Houghten (1992) Bio/Techniques 13:412-421), or on 
beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) 
Nature 364:555-556), bacteria (U.S. Patent No. 
5 5,223,409), spores (Patent NOS . 5,571,698; 5,403,484; and 
5,223,409), plasmids (Cull et al . (1992) Proc . Natl. 
Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
(1990) Science 249^386-390; Devlin (1990) Science 
249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 
10 USA 87:6378-6382; and Felici (1991) J. Mol . Biol. 
222 :301-310) . 

In one embodiment, an assay is a cell-based assay in 
which a cell which expresses a membrane -bound form of a 
polypeptide of the invention, or a biologically active 

15 portion thereof, on the cell surface is contacted with a 
test compound and the ability of the test compound to 
bind to the polypeptide determined. The cell, for 
example, can be a yeast cell or a cell of mammalian 
origin. Determining the ability of the test compound to 

20 bind to the polypeptide can be accomplished, for example, 
by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to 
the polypeptide or biologically active portion thereof 
can be determined by detecting the labeled compound in a 

2 5 complex. For example, test compounds can be labeled with 

125 I, 35 S, U C, or 3 H, either directly or indirectly, and 
the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. 
Alternatively, test compounds can be enzymatically 
30 labeled with, for example, horseradish peroxidase, 

alkaline phosphatase, or lucif erase, and the enzymatic 
label detected by determination of conversion of an 
appropriate substrate to product. In a preferred 
embodiment, the assay comprises contacting a cell which 

3 5 expresses a membrane -bound form of a polypeptide of the 
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polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the 
ability cf the test compound to interact with the 
polypeptide, wherein determining the ability of the test 

5 compound to interact with the polypeptide comprises 
determining the ability of the test compound to 
preferentially bind to the polypeptide or biologically 
active portion thereof as compared to the known compound. 
In another embodiment, an assay is a cell -free assay 

0 comprising contacting a polypeptide of the invention or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the 
polypeptide or biologically active portion thereof. 

5 Determining the ability of the test compound to modulate 
the activity of the polypeptide can be accomplished, for 
example, by determining the ability of the polypeptide to 
bind to a target molecule by one of the methods described 
above for determining direct binding. In an alternative 

0 embodiment, determining the ability of the test compound 
to modulate the activity of the polypeptide can be 
accomplished by determining the ability of the 
polypeptide of the invention to further modulate the 
target molecule. For example, the catalytic/enzymatic 

5 activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
comprises contacting a polypeptide of the invention or 
biologically active portion thereof with a known compound 

0 which binds the polypeptide to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with the polypeptide, wherein determining the ability of 
the test compound to interact with the polypeptide 

5 comprises determining the ability of the polypeptide to 
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polypeptide of the invention or some other polypeptide or 
protein. For example, a target molecule can be a 
component of a signal transduction pathway, which 
facilitates transduction of an extracellular signal 

5 (e.g., a signal generated by binding of a compound to a 
polypeptide of the invention) through the cell membrane 
and into the cell or a second intercellular protein which 
has catalytic activity or a protein which facilitates the 
association of downstream signaling molecules with a 

0 polypeptide of the invention. Determining the ability of 
a polypeptide of the invention to bind to or interact 
with a target molecule can be accomplished by determining 
the activity of the target molecule. For example, the 
activity of the target molecule can be determined by 

5 detecting induction of a cellular second messenger of the 
target (e.g., intracellular Ca 2 *, diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a regulatory element 

0 that is responsive to a polypeptide of the invention 
operably linked to a nucleic acid encoding a detectable 
marker, e.g. luciferase) , or detecting a cellular 
response, for example, cellular differentiation, or cell 
proliferation. 

5 In yet another embodiment, an assay of the present 
invention is a cell-free assay comprising contacting a 
polypeptide of the invention or biologically active 
portion thereof with a test compound and determining the 
ability of the test compound to bind to the polypeptide 

0 or biologically active portion thereof. Binding of the 
test compound to the polypeptide can be determined either 
directly or indirectly as described above. In a 
preferred embodiment, the assay includes contacting the 
polypeptide of the invention or biologically active 
5 portion thereof with a known compound which binds the 
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Chemical; St. Louis, MO) or glutathione derivatized 
microtitre plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed 
target protein or A polypeptide of the invention, and the 

5 mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and 
pH) . Following incubation, the beads or microtitre plate 
wells are washed to remove any unbound components and 
complex format ion is measured either directly or 

0 indirectly, for example, as described above. 

Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the 
polypeptide of the invention can be determined using 
standard techniques. 

5 Other -techniques for immobilizing proteins on matrices 
can also be used in the screening assays of the 
invention. For example, either the polypeptide of the 
invention or its target molecule can be immobilized 
utilizing conjugation of biotin and streptavidin. 

0 Biotinylated polypeptide of the invention or target 
molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL) , and immobilized in the wells of streptavidin-coated 

5 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with the polypeptide of the Invention 
or target molecules but which do not interfere with 
binding of the polypeptide of the invention to its target 
molecule can be derivatized to the wells of the plate, 

0 and unbound target or polypeptidede of the invention 
trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those 
described above for the GST- immobilized complexes, 
include immunodetection of complexes using antibodies 

5 reactive with the polypeptide of the invention or target 
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preferentially bind to or modulate the activity of a 
target molecule. 

The cell -free assays of the present invention are 
amenable to use of both a soluble form or the membrane- 
5 bound form of a polypeptide of the invention. In the 
case of cell-free assays comprising the membrane -bound 
form of the polypeptide, it may be desirable to utilize a 
solubilizing agent such that the membrane -bound form of 
the polypeptide is maintained in solution. Examples of 

10 such solubilizing agents include non- ionic detergents 
such as n-octylglucoside, n-dodecylglucoside, n- 
dodecylmal toside , octanoyl -N-methylglucamide , decanoyl -N- 
methylglucamide, Triton X-100, Triton X-114, Thesit, 
Isotridecypoly (ethylene glycol ether)n, 3- [(3- 

15 cholamidopropyl) dimethylamminio] -1 -propane sulfonate 
(CHAPS) , 3- [ (3 -cholamidopropyl) dimethylamminio] -2- 
hydroxy-1- propane sulfonate (CHAPSO) , or N - dodecy 1 =N , N - 
dimethyl -3 -ammonio-1 -propane sulfonate . 

In more than one embodiment of the above assay methods 

20 of the present invention, it may be desirable to 

immobilize either the polypeptide of the invention or its 
target molecule to facilitate separation of complexed 
from uncomplexed forms of one or both of the proteins, as 
well as to accommodate automation of the assay. Binding 

25 of a test compound to the polypeptide, or interaction of 
the polypeptide with a target molecule in the presence 
and absence of a candidate compound, can be accomplished 
in any vessel suitable for containing the reactants. 
Examples of such vessels include microtitre plates, test 

3 0 tubes, and micro-centrifuge tubes. In one embodiment, a 
fusion protein can be provided which adds a domain that 
allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S-transf erase fusion 
proteins or glutathione-S-transf erase fusion proteins can 

3 5 be adsorbed onto glutathione sepharose beads (Sigma 
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No. WO 94/10300) , to identify other proteins, which bind 
to or interact with the polypeptide of the invention and 
modulate activity of the polypeptide of the invention. 
Such binding proteins are also likely to be involved in 
5 the propagation of signals by the polypeptide of the 
inventions as, for example, upstream or downstream 
elements of a signaling pathway involving the polypeptide 
of the invention. 

This invention further pertains to novel agents 
10 identified by the above -described screening assays and 
uses thereof for treatments as described herein. 

B . Detection Assays 

Portions or fragments of the cDNA sequences identified 
herein (and the corresponding complete gene sequences) 

15 can be used in numerous ways as polynucleotide reagents. 
For example, these sequences can be used to: (i) map 
their respective genes on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 

20 (tissue typing) ; and (iii) aid in forensic identification 
of a biological sample. These applications are described 
in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a 
2 5 gene has been isolated, this sequence can be used to map 
the location of the gene on a chromosome. Accordingly, 
nucleic acid molecules described herein or fragments 
thereof, can be used to map the location of the 
corresponding genes on a chromosome. The mapping of the 
30 sequences to chromosomes is an important first step in 
correlating these sequences with genes associated with 
disease . 
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molecule, as well as enzyme- linked assays which rely on 
detecting an enzymatic activity associated with the 
polypeptide of the invention or target molecule. 

In another embodiment, modulators of expression of a 
5 polypeptide of the invention are identified in a method 
in which a cell is contacted with a candidate compound 
and the expression of the selected mRNA or protein (i.e., 
the mRNA or protein corresponding to a polypeptide or 
nucleic acid of the invention) in the cell is determined. 

10 The level of expression of the selected mRNA or protein 
in the presence of the candidate compound is compared to 
the level of expression of the selected mRNA or protein 
in the absence of the candidate compound. The candidate 
compound can then be identified. as a modulator of 

15 expression of the polypeptide of the invention based on 
this comparison. For example, when expression of the 
selected mRNA or protein is greater (statistically 
significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is 

20 identified as a stimulator of the selected mRNA or 

protein expression. Alternatively, when expression of 
the selected mRNA or protein is less (statistically 
significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is 

25 identified as an inhibitor of the selected mRNA or 

protein expression. the level of the selected mRNA or 
protein expression in the cells can be determined by 
methods described herein. 

In yet another aspect of the invention, a polypeptide 

30 of the inventions can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 72:223- 
232; Madura et al . (1993) J". Biol. Che/n. 268:12046-12054; 
Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 

35 ec al . (1993) Oncogene 8:1693-1696; and PCT Publication 
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marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
5 families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic 

10 map data. (Such data are found, for example, in V. 

McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 

15 linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Nature 
325:783-787. 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 
20 associated with a gene of the invention can be 

determined. If a mutation is observed in some or all of 
the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the 
causative agent of the particular disease. Comparison of 

2 5 affected and unaffected individuals generally involves 

first looking for structural alterations in the 
chromosomes such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR 
based on that DNA sequence. Ultimately, complete 

3 0 sequencing of genes from several individuals can be 

performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 
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Briefly, genes can be mapped to chromosomes by 
preparing PCR primers (preferably 15-25 bp in length) 
from the sequence of a gene of the invention. Computer 
analysis of the sequence of a gene of the invention can 
5 be used to rapidly select primers that do not span more 
than one exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing 
individual human chromosomes. Only those hybrids 

10 containing the human gene corresponding to the gene 

sequences will yield an amplified fragment. For a review 
of this technique, see D'Eustachio et al . ((1983) Science 
220:919-924) . 

PCR mapping of somatic cell hybrids is a rapid 

15 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the nucleic acid sequences of the invention to design 
oligonucleotide primers, sublocalization can be achieved 

20 with panels of fragments from specific chromosomes. 

Other mapping strategies which can similarly be used to 
map a gene to its chromosome include in situ 
hybridization (described in Fan et al . (1990) Proc. Natl. 
Acad. Sci . USA 87:6223-27), pre-screening with labeled 

25 flow-sorted chromosomes (CITE) , and pre-selection by 
hybridization to chromosome specific cDNA libraries 
(CITE) . Fluorescence in situ hybridization (FISH) of a 
DNA sequence to a metaphase chromosomal spread can 
further be used to provide a precise chromosomal location 

30 in one step. For a review of this technique, see Verma 
en al . , (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 

35 on that chromosome, or panels of reagents can be used for 
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individual humans occurs with a frequency of about once 
per each 500 bases. Each of the sequences described 
herein can, to some degree, be used as a standard against 
which DNA from an individual can be compared for 
5 identification purposes. Because greater numbers of 
polymorphisms occur in the noncoding regions, fewer 
sequences are necessary to differentiate individuals. 
The noncoding sequences of SEQ ID NO:l can comfortably 
provide positive individual identification with a panel 

10 of perhaps 10 to 1,000 primers which each yield a 

noncoding amplified sequence of 100 bases. If predicted 
coding sequences, such as those in SEQ ID NO: 3 are used, 
a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

15 If a panel of reagents from the nucleic acid sequences 
described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 

20 positive identification of the individual, living or 
dead, can be made from extremely small tissue samples. 

3 . Use of Partial Gene Sequences in Forensic Biology 
DNA-based identification techniques can also be used in 
forensic biology. Forensic biology is a scientific field 

25 employing genetic typing of biological evidence found a.t 
a crime scene as a means for positively identifying, for 
example, a perpetrator of a crime. To make such an 
identification, PGR technology can be used to amplify DNA 
sequences taken from very small biological samples such 

30 as tissues, e.g., hair or skin, or body fluids, e.g., 
blood, saliva, or semen found at a crime scene. The 
amplified sequence can then be compared to a standard, 
thereby allowing identification of the origin of the 
biological sample. 
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2. Tissue Typing 

The nucleic acid sequences of the present invention can 
also be used to identify individuals from minute 
biological samples. The United States military, for 
5 example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 
personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 

10 identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification 
difficult. The sequences of the present invention. are 
useful as additional DNA markers for RFLP (described in 

15 U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can 
be used to provide an alternative technique which 
determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 

20 nucleic acid sequences described herein can be used to 
prepare two PCR primers from the 5' and 3' ends of the 
sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, 

25 prepared in this manner, can provide unique individual 
identifications, as each individual will have a unique 
set of such DNA sequences due to allelic differences. 
The sequences of the present invention can be used to 
obtain such identification sequences from individuals and 

30 from tissue. The nucleic acid sequences of the invention 
uniquely represent portions of the human genome. Allelic 
variation occurs to some degree in the coding regions of 
these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between 
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to diagnostic assays for determining expression of a 
polypeptide or nucleic acid of the invention and/or 
activity of a polypeptide of the invention, in the 
context of a biological sample (e.g., blood, serum, 
5 cells, tissue) to thereby determine whether an individual 
is afflicted with a disease or disorder, or is at risk of 
developing a disorder, associated with aberrant 
expression or activity of a polypeptide of the invention. 
The invention also provides for prognostic (or 

10 predictive) assays for determining whether an individual 
is at risk of developing a disorder associated with 
aberrant expression or activity of a polypeptide of the 
invention. For example, mutations in a gene of the 
invention can be. assayed in a biological sample. Such 

15 assays can be used for prognostic or predictive purpose 
to thereby prophylactically treat an individual prior to 
the onset of a disorder characterized by or associated 
with aberrant expression or activity of a polypeptide of 
the invention. 

20 Another aspect of the invention provides methods for 
expression of a nucleic acid or polypeptide of the 
invention or activity of a polypeptide of the invention 
in an individual co thereby select appropriate 
therapeutic or prophylactic agents for that individual 

25 (referred to herein as "pharmacogenomics") . 

Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 

3 0 determine the ability of the individual to respond to a 
particular agent) . 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of a polypeptide 

35 of the invention in clinical trials. 
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The sequences of the present invention can be used to 
provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
5 identifications by, for example, providing another 

"identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alternative to patterns 

10 formed by restriction enzyme generated fragments. 

Sequences targeted to noncoding regions are particularly 
appropriate for this use as greater numbers of 
polymorphisms occur in the noncoding regions, making it 
easier to differentiate individuals using this technique. 

15 Examples of polynucleotide reagents include the nucleic 
acid sequences of the invention or portions thereof, 
e.g., fragments derived from noncoding regions having a 
length of at least 20 or 30 bases. 

The nucleic acid sequences described herein can further 

20 be used to provide polynucleotide reagents, e.g., labeled 
or labelable probes which can be used in, for example, an 
in situ hybridization technique, to identify a specific 
tissue, e.g., brain tissue. This can be very useful in 
cases where a forensic pathologist is presented with a 

25 tissue of unknown origin. Panels of such probes can be 
used to identify tissue by species and/or by organ type. 


C . Predictive Medicine 

The present invention also pertains to the field of 
predictive medicine in which diagnostic assays, 
3 0 prognostic assays, pharmacogenomics , and monitoring 
clinical trails are used for prognostic (predictive) 
purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
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indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples 
of indirect labeling include detection of a primary 
antibody using a f luorescently labeled secondary antibody 
5 and end- labeling of a DNA probe with biotin such that it 
can be detected with f luorescently labeled streptavidin. 
The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present 

10 within a subject. That is, the detection method of the 
invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of mRNA 
include Northern hybridizations and in situ 

15 hybridizations. In vitro techniques for detection of A 
polypeptide of the invention include enzyme linked 
immunosorbent assays (ELISAs) , Western blots, 
immunoprecipitations and immunofluorescence. In vitro 
techniques for detection of genomic DNA include Southern 

20 hybridizations. Furthermore, in vivo techniques for 
detection of a polypeptide of the invention include 
introducing into a subject a labeled antibody directed 
against the polypeptide. For example, the antibody can 
be labeled with a radioactive marker whose presence and 

25 location in a subject can be detected by standard imaging 
techniques. 

In one embodiment, the biological sample contains 
protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
3 0 test subject or genomic DNA molecules from the test 

subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 
from a subject . 

In another embodiment, the methods further involve 
35 obtaining a control biological sample from a control 
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These and other agents are described in further detail 
in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
5 absence of a polypeptide or nucleic acid of the invention 
in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting a 
polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of 

10 the invention such that the presence of a polypeptide or 
nucleic acid of the invention is detected in the 
biological sample. A preferred agent for detecting mRNA 
or genomic DNA encoding a polypeptide of the invention is 
a labeled nucleic acid probe capable of hybridizing to 

15 mRNA or genomic DNA encoding a polypeptide of the 

invention. The nucleic acid probe can be, for example, a 
full-length cDNA, such as the nucleic acid of SEQ ID N0:1 
or 4, or a portion thereof, such as an oligonucleotide of 
at least 15, 30, 50, 100, 250 or 500 nucleotides in 

20 length and sufficient to specifically hybridize under 

stringent conditions to a mRNA or genomic DNA encoding a 
polypeptide of the invention. Other suitable probes for 
use in the diagnostic assays of the invention are 
described herein. 

25 A preferred agent for detecting A polypeptide of the 
invention is an antibody capable of binding to A 
polypeptide of the invention, preferably an antibody with 
a detectable label. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a 

3 0 fragment thereof (e.g., Fab or F(ab') 2 ) can be used. The 
term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or 
antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as 
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For oligonucleotide-based kits, the kit may comprise, 
for example: (1) an oligonucleotide, e.g., a detectably 
labeled oligonucleotide, which hybridizes to a nucleic 
' acid sequence encoding a polypeptide of the invention or 
5 (2) a pair of primers useful for amplifying a nucleic 
acid molecule encoding a polypeptide of the invention. 

The kit may also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit 
may also comprise components necessary for detecting the 
10 detectable agent (e.g., an enzyme or a substrate). The 
kit may also contain a control sample or a series of 
control samples which can be assayed and compared to the 
test sample contained. Each component of the kit is 
usually enclosed within an individual container and all 
15 of the various containers are within a single package 
along with instructions for observing whether the tested 
subject is suffering from or is at risk of developing a 
disorder associated with aberrant expression of the 
polypeptide. 

20 2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
disorder associated with aberrant expression or activity 

25 of a polypeptide of the invention. For example, the 

assays described herein, such as the preceding diagnostic 
assays or the following assays, can be utilized to 
identify a subject having or at risk of developing a 
disorder associated with aberrant expression or activity 

30 of a polypeptide of the invention. Alternatively, the 
prognostic assays can be utilized to identify a subject 
having or at risk for developing such a disease or 
disorder. Thus, the present invention provides a method 
in which a test sample is obtained from a subject and a 
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subject, contacting the control sample with a compound or 
agent capable of detecting a polypeptide of the invention 
or mRNA or genomic DNA encoding a polypeptide of the 
invention, such that the presence of the polypeptide or 
5 mRNA or genomic DNA encoding the polypeptide is detected 
in the biological sample, and comparing the presence of 
the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of 
the polypeptide or mRNA or genomic DNA encoding the 

10 polypeptide in the test sample. 

The invention also encompasses kits for detecting the 
presence of a polypeptide or nucleic acid of the 
invention in a biological sample (a test sample) . Such 
kits can be used to determine if a subject is suffering 

15 from or is at increased risk of developing a disorder 
associated with aberrant expression of a polypeptide of 
the invention (e.g., an immunological disorder). For 
example, the kit can comprise a labeled compound or agent 
capable of detecting the polypeptide or mRNA encoding the 

2 0 polypeptide in a biological sample and means for 

determining the amount of the polypeptide or mRNA in the 
sample (e.g., an antibody which binds the polypeptide or 
an oligonucleotide probe which binds to DNA or mRNA 
encoding the polypeptide) . Kits may also include 

25 instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 
associated with aberrant expression of the polypeptide if 
the amount of the polypeptide or mRNA encoding the 
polypeptide is above or below a normal level. 

30 For antibody-based kits, the kit may comprise, for 

example: (1) a first antibody (e.g., attached to a solid 
support) which binds to a polypeptide of the invention; 
and, optionally, (2) a second, different antibody which 
binds to either the polypeptide or the first antibody and 

35 is conjugated to a detectable agent. 
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In preferred embodiments, the methods include detecting, 
in a sample of cells from the subject, the presence or 
absence of a genetic lesion or mutation characterized by 
at least one of an alteration affecting the integrity of 

5 a gene encoding the polypeptide of the invention, or the 
mis -expression of the gene encoding the polypeptide of 
the invention. For example, such genetic lesions or 
mutations can be detected by ascertaining the existence 
of at least one of: Da deletion of one or more 

0 nucleotides from the gene; 2) an addition of one or more 
nucleotides to the gene; 3) a substitution of one or more 
nucleotides of the gene; 4) a chromosomal rearrangement 
of the gene; 5) an alteration in the level of a messenger 
RNA transcript of the gene; 6) an aberrant modification 

5 of the gene, such as of the methylation pattern of the 
genomic DNA; 7) the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of the gene; 8) a 
non-wild type level of a the protein encoded by the gene; 
9) an allelic loss of the gene; and 10) an inappropriate 

0 post-translational modification of the protein encoded by 
the gene. As described herein, there are a large number 
of assay techniques known in the art which can be used 
for detecting lesions in a gene. 

In certain embodiments, detection of the lesion 

5 involves the use of a probe/primer in a polymerase chain 
reaction (PGR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al . (1988) Science 241:1077-1080; and 

0 Nakazawa et al . (1994) Proc . Natl. Acad. Sci . USA 91:360- 
364) , the latter of which can be particularly useful for 
detecting point mutations in a gene (see, e.g., Abravaya 
et al. (1995) Nucleic Acids Res. 23:675-682). This 
method can include the steps of collecting a sample of 

5 cells from a patient, isolating nucleic acid (e.g., 
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polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of 
the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
5 associated with aberrant expression or activity of the 
polypeptide. As used herein, a "test sample" refers to a 
biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

10 Furthermore, the prognostic assays described herein can 
be used to determine whether a subject can be 
administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 

15 disorder associated with aberrant expression or activity 
of a polypeptide of the invention. For example, such 
methods can be used to determine whether a subject can be 
effectively treated with a specific agent or class of 
agents (e.g., agents of a type which decrease activity of 

2 0 the polypeptide) . Thus, the present invention provides 
methods for determining whether a subject can be 
effectively treated with an agent for a disorder 
associated with aberrant expression or activity of a 
polypeptide of the invention in which a test sample is 

2 5 obtained and the polypeptide or nucleic acid encoding the 

polypeptide is detected (e.g., wherein the presence of 
the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a 
disorder associated with aberrant expression or activity 

3 0 of the polypeptide) . 

The methods of the invention can also be used to detect 
genetic lesions or mutations in a gene of the invention, 
thereby determining if a subject with the lesioned gene 
is at risk for a disorder characterized aberrant 
35 expression or activity of a polypeptide of the invention. 
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specific mutations by development or loss of a ribozyme 
cleavage site. 

In other embodiments, genetic mutations can be 
identified by hybridizing a sample and control nucleic 

5 acids, e.g., DNA or RNA, to high density arrays 

containing hundreds or thousands of oligonucleotides 
probes (Cronin et al . (1996) Human Mutation 7:244-255; 
Kozal et al . (1996) Nature Medicine 2:753-759). For 
example, genetic mutations can be identified in two- 

0 dimensional arrays containing light -generated DNA probes 
as described in Cronin et al . , supra. Briefly, a first 
hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify 
base changes between the sequences by making linear 

5 arrays of sequential overlapping probes. This step 

allows the identification of point mutations. This step 
is followed by a second hybridization array that allows 
the characterization of specific mutations by using 
smaller, specialized probe arrays complementary to all 

0 variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the 
wild- type gene and the other complementary to the mutant 
gene . 

In yet another embodiment, any of a variety of 
5 sequencing reactions known in the art can be used to 
directly sequence the selected gene and detect mutations 
by comparing the sequence of the sample nucleic acids 
with the corresponding wild- type (control) sequence. 
Examples of sequencing reactions include those based on 
0 techniques developed by Maxim and Gilbert ((1977) Proc. 
Natl. Acad. Sci . USA 74:560) or Sanger ((1977) Proc. 
Natl. Acad. Sci. USA 74:5463). It is also contemplated 
that any of a variety of automated sequencing procedures 
can be utilized when performing the diagnostic assays 
5 ((1995) Bio/Techniques 19:448), including sequencing by 
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genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more 
primers which specifically hybridize to the selected gene 
under conditions such that hybridization and 
5 amplification of the gene (if present) occurs, and 
detecting the presence or absence of an amplification 
product, or detecting the size of the amplification 
product and comparing the length to a control sample. It 
is anticipated that PGR and/or LCR may be desirable to 

10 use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 
described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 

15 Proc. Natl. Acad. Sci . USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 
Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 

20 detection of the amplified molecules using techniques 

well known to those of skill in the art. These detection 
schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in 
very low numbers. 

25 In an alternative embodiment, mutations in a selected 
gene from a sample cell can be identified by alterations 
in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 

30 endonucleases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 
fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 

3 5 No. 5,4 98,531) can be used to score for the presence of 
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E. coli cleaves A at G/A mismatches and the thymidine DNA 
glycosylase from HeLa cells cleaves T at G/T mismatches 
(Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on a 

5 selected sequence, e.g., a wild-type sequence, is 

hybridized to a cDNA or other DNA product from a test 
cell(s) . The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. 

0 See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic 
mobility will be used to identify mutations in genes. 
For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in 

5 electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al . (1989) Proc. Natl. Acad. Sci. 
USA 86:2766; see also Cotton (1993) Mutat. Res. 285:125- 
144; Hayashi (1992) Genet. Anal. Tech. Appl . 9:73-79). 
Single -stranded DNA fragments of sample and control 

0 nucleic acids will be denatured and allowed to renature . 
The secondary structure of single-stranded nucleic acids 
varies according to sequence, and the resulting 
alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA 

5 fragments may be labeled or detected with labeled probes. 
The sensitivity of the assay may be enhanced by using RNA 
(rather than DNA) , in which the secondary structure is 
more sensitive to a change in sequence. In a preferred 
embodiment, the subject method utilizes heteroduplex 

0 analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic 
mobility (Keen et al . (1991) Trends Genet. 7:5). 

In yet another embodiment, the movement of mutant or 
wild- type fragments in polyacrylamide gels containing a 

5 gradient of denaturant is assayed using denaturing 
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mass spectrometry (see, e.g., PCT Publication No. WO 
94/16101; Cohen et al . (1996) Adv. Chromatogr. 36:127- 
162/ and Griffin et al . (1993) Appl . Biochem. Biotechnol . 
38:147-159) . 

5 Other methods for detecting mutations in a selected 
gene include methods in which protection from cleavage 
agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al . (1985) Science 
230:1242). In general, the technique of "mismatch 

10 cleavage" entails providing heteroduplexes formed by 

hybridizing (labeled) RNA or DNA containing the wild- type 
sequence with potentially mutant RNA or DNA obtained from 
a tissue sample. The double -stranded duplexes are 
treated with an agent which cleaves single- stranded 

15 regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample 
scrands. RNA/DNA duplexes can be treated with RNase to 
digest mismatched regions, and DNA/DNA hybrids can be 
treated with SI nuclease to digest mismatched regions. 

20 In other embodiments, either DNA/DNA or RNA/DNA duplexes 
can be treated with hydroxylamine or osmium tetroxide and 
with piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing 

25 polyacrylamide gels to determine the site of mutation. 
See, e.g., Cotton et al . (1988) Proc. Natl. Acad. Sci . 
USA 85:43 97; Saleeba et al . (1992) Methods Enzymol . 
217:286-295. In a preferred embodiment, the control DNA 
or RNA can be labeled for detection. 

30 In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 
mismatched base pairs in double-stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in cDNAs obtained 

35 from samples of cells. For example, the mutY enzyme of 
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gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a *GC clamp of 
5 approximately 40 bp of high-melting GC-rich DNA by PGR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys . Chem. 265:12753). 

10 Examples of other techniques for detecting point 
mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

15 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl. Acad. 
Sci. USA 86 : 6230) . Such allele specific oligonucleotides 

20 are hybridized to PGR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification technology 

25 which depends on selective PGR amplification may be used 
in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

30 differential hybridization) (Gibbs et al . (1989) Nucleic 
Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238). In addition, it may be desirable to 

35 introduce a novel restriction site in the region of the 
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mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 
5 (1991) Proc. Natl. Acad. Sci. USA 88:189). In such 
cases, ligation will occur only if there is a perfect 
match a: the 3' end of the 5' sequence making it possible 
to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

10 amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

15 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a gene encoding a polypeptide of the 
invention. 

Furthermore, any cell type or tissue, preferably 
20 peripheral blood leukocytes, in which the polypeptide of 
the invention is expressed may be utilized in the 
prognostic assays described herein. 

3 . P h a r ma c oge nom i c s 

25 Agents, or modulators which have a stimulatory or 
inhibitory effect on activity or expression of a 
polypeptide of the invention as identified by a screening 
assay described herein can be administered to individuals 
to treat (prophylactically or therapeutically) disorders 

3 0 associated with aberrant activity of the polypeptide. In 
conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 

35 considered. Differences in metabolism of therapeutics 
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can lead to severe toxicity or therapeutic failure by- 
altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 
5 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 

10 Accordingly, the activity of a polypeptide of the 
invention, expression of a nucleic acid of the 
invention, or mutation content of a gene of the invention 
in an individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 

15 treatment of the individual. 

Pharmacogenomics deals with clinically significant 
hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 43 (2) :254- 

20 266. In general, two types of pharmacogenetic conditions 
can be differentiated. Genetic conditions transmitted as 
a single factor altering the way drugs act on the body 
are referred to as "altered drug action." Genetic 
conditions transmitted as single factors altering the way 

2 5 the body acts on drugs are referred to as "altered drug 
metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical 

30 complication is haemolysis after ingestion of oxidant 
drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug 
metabolizing enzymes is a major determinant of both the 

35 intensity and duration of drug action. The discovery of 
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genetic polymorphisms of drug metabolizing enzymes (e.g., 
N-acetyl transferase 2 (NAT 2) and cytochrome P450 enzymes 
CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or 
5 show exaggerated drug response and serious toxicity after 
taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the 
population, the extensive metabolizer (EM) and poor 
metabolizer (PM) . The prevalence of PM is different 

10 among different populacions. For example, the gene 
coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 

15 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of 
codeine mediated by its CYP2D6-f ormed metabolite 

20 morphine. The other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. 
Recently, the molecular basis of ultra-rapid metabolism 
has been identified to be due to CYP2D6 gene 
amplification. 

25 Thus, the activity of a polypeptide of the invention, 
expression of a nucleic acid encoding the polypeptide, .or 
mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 

30 treatment of the individual. In addition, 

pharmacogenetic studies can be used to apply genotyping 
of polymorphic alleles encoding drug-metabolizing enzymes 
to the identification of an individual's drug 
responsiveness phenccype. This knowledge, when applied 

35 to dosing or drug selection, can avoid adverse reactions 
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or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a 
modulator of activity or expression of the polypeptide, 
such as a modulator identified by one of the exemplary 
5 screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 
compounds) on the expression or activity of a polypeptide 
of the invention (e.g., the ability to modulate aberrant 

10 cell proliferation and/or differentiation) can be applied 
not only in basic drug screening, but also in clinical 
trials. For example, the effectiveness of an agent, as 
determined by a screening assay as described herein, to 
increase gene expression, protein levels or protein 

15 activity, can be monitored in clinical trials of subjects 
exhibiting decreased gene expression, protein levels, or 
protein activity. Alternatively, the effectiveness of an 
agent, as determined by a screening assay, to decrease 
gene expression, protein levels or protein activity, can 

20 be monitored in clinical trials of subjects exhibiting 
increased gene expression, protein levels, or protein 
activity. In such clinical trials, expression or 
activity of a polypeptide of the invention and 
preferably, that of other polypeptide that have been 

25 implicated in for example, a cellular proliferation 
disorder, can be used as a marker of the immune 
responsiveness of a particular cell. 

For example, and not by way of limitation, genes, 
including those of the invention, that are modulated in 

30 cells by treatment with an agent (e.g., compound, drug or 
small molecule) which modulates activity or expression of 
a polypeptide of the invention (e.g., as identified in a 
screening assay described herein) can be identified. 
Thus, to study the effect of agents on cellular 
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proliferation disorders, for example, in a clinical 
trial, cells can be isolated and RNA prepared and 
analyzed for the levels of expression of a gene of the 
invention and other genes implicated in the disorder. 
5 The levels of gene expression (i.e., a gene expression 
pattern) can be quantified by Northern blot analysis or 
RT-PCR, as described herein, or alternatively by 
measuring the amount of protein produced, by one of the 
methods as described herein, or by measuring the levels 

10 of activity of a gene of the invention or other genes. 
In this way, the gene expression pattern can serve as a 
marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, 

15 treatment of the individual with the agent. 

In a preferred embodiment, the present invention 
provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 

20 acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of the polypeptide or nucleic acid of 

25 the invention in the preadministration sample; (iii) 
obtaining one or more post -administration samples from 
the subject; (iv) detecting the level the of the 
polypeptide or nucleic acid of the invention in the post- 
administration samples; (v) comparing the level of the 

3 0 polypeptide or nucleic acid of the invention in the pre- 
administration sample with the level of the polypeptide 
or nucleic acid of the invention in the post- 
administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. 

3 5 For example, increased administration of the agent may be 
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desirable to increase the expression or activity of the 
polypeptide to higher levels than detected, i.e., to 
increase the effectiveness of the agent.. Alternatively, 
decreased administration of the agent may be desirable to 
5 decrease expression or activity of the polypeptide to 
lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

C. Methods of Treatment 

The present invention provides for both prophylactic 
10 and therapeutic methods of treating a subject at risk of 
(or susceptible to) a disorder or having a disorder 
associated with aberrant expression or activity of a 
polypeptide of the invention. 

1 . Prophylactic Methods 

15 In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 
associated with an aberrant expression or activity of a 
polypeptide of the invention, by administering to the 
subject an agent which modulates expression or at least 

2 0 one activity of the polypeptide. Subjects at risk for a 
disease which is caused or contributed to by aberrant 
expression or activity of a polypeptide of the invention 
can be identified by, for example, any or a combination 
of diagnostic or prognostic assays as described herein. 

25 Administration of a prophylactic agent can -occur prior to 
the manifestation of symptoms characteristic of the 
aberrancy, such that a disease or disorder is prevented 
or, alternatively, delayed in its progression. Depending 
on the type of aberrancy, for example, an agonist or 

30 antagonist agent can be used for treating the subject. 
The appropriate agent can be determined based on 
screening assays described herein. 
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2 . Therapeutic Methods 

Another aspect: of the invention pertains to methods of 
modulating expression or activity of a polypeptide of the 
invention for therapeutic purposes. The modulatory 
5 method of the invention involves contacting a cell with 
an agent that modulates one or more of the activities of 
the polypeptide. An agent that modulates activity can be 
an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of the 

10 polypeptide, a peptide, a peptidomimet ic , or other small 
molecule. In one embodiment, the agent stimulates one or 
more of the biological activities of the polypeptide. 
Examples of such stimulatory agents include the active 
polypeptide of the invention and a nucleic acid molecule 

15 encoding the polypeptide of the invention that has been 
introduced into the cell. In another embodiment, the 
agent inhibits one or more of the biological activities 
of the polypeptide of the invention. Examples of such 
inhibitory agents include antisense nucleic acid 

20 molecules and antibodies. These modulatory methods can 
be performed in vitro (e.g., by culturing the cell with 
the agent) or, alternatively, in vivo (e.g, by 
administering the agent to a subject) . As such, the 
present invention provides methods of treating an 

25 individual afflicted with a disease or disorder 

characterized by aberrant expression or activity a 
polypeptide of the invention. In one embodiment, the 
method involves administering an agent (e.g., an agent 
identified by a screening assay described herein) , or 

30 combination of agents that modulates (e.g., upregulates 
or downregulates) expression or activity. In another 
embodiment, the method involves administering a 
polypeptide of the invention or a nucleic acid molecule 
of the invention as therapy to compensate for reduced or 

35 aberrant expression or activity of the polypeptide. 
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Stimulation of activity is desirable in situations in 
which activity or expression is abnormally low 
downregulated and/or in which increased activity is 
likely to have a beneficial effect. Conversely, 
5 inhibition of activity is desirable in situations in 
which activity or expression is abnormally high or 
upregulated and/or in which decreased activity is likely 
to have a beneficial effect. 

This invention is further illustrated by the following 
10 examples which should not be construed as limiting. The 
contents of all references, patents and published patent 
applications cited throughout this application are hereby 
incorporated by reference . 

Equivalents 

15 Those skilled in the art will recognize, or be able to 
ascertain using no more than routine experimentation, 
many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are 
intended to be encompassed by the following claims. 

20 What is claimed is: 
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1. An isolated nucleic acid molecule selected from 
the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide 
sequence which is at least 55% identical to the 
5 nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number , the cDNA insert of the plasmid deposited 

with ATCC as Accession Number , or a complement 

thereof ; 

10 b) a nucleic acid molecule comprising a fragment of 
at least 3 00 nucleotides of the nucleotide sequence of 
SEQ ID NO : 1 , 3, 4, or 6, the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , the 

cDNA insert of the plasmid deposited with ATCC as 

15 Accession Number , or a complement thereof; 

c) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with ATCC as Accession Number 

20 , or the amino acid sequence of the cDNA insert of 

the plasmid deposited with ATCC as Accession Number 

d) a nucleic acid molecule which encodes a fragment 
of a polypeptide comprising the amino acid sequence of 

2 5 SEQ ID NO: 2 or 5, the polypeptide encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number , or the polypeptide encoded by the 

cDNA insert of the plasmid deposited with ATCC as 
Accession Number , wherein the fragment comprises 

30 at least 15 contiguous amino acids of SEQ ID NO: 2 or 5, 
the polypeptide encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the polypeptide encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number and 
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e) a nucleic acid molecule which encodes a. naturally- 
occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2 or 5, the amino acid 
sequence encoded by the cDNA insert of the plasmid 

5 deposited with ATCC as Accession Number , or the 

amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number , 

wherein the nucleic acid molecule hybridizes to a nucleic 
acid molecule comprising SEQ ID NO: 3 or 6 or a complement 
10 thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, 
which is selected from the group consisting of: 

a) a nucleic acid comprising the nucleotide sequence 
of SEQ ID N0:1, 3, 4, or 5, the cDNA insert of the 

15 plasmid deposited with ATCC as Accession Number , 

the cDNA insert of the plasmid deposited with ATCC as 
Accession Number , or a complement thereof; and 

b) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 

2 0 NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number ' , or the amino acid sequence encoded 

by the cDNA insert of the plasmid deposited with ATCC as 
Accession Number . 


25 3. The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4. The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 
polypeptide. 

30 5. A host cell which contains the nucleic acid 
molecule of claim 1. 


WO 00/18800 


PCT/US99/228I8 


- 102 - 

6. The host cell of claim 5 which is a mammalian host 
cell. 

7. A non-human mammalian host cell containing the 
nucleic acid molecule of claim 1. 

8. An isolated polypeptide selected from the group 
consisting of: 

a) a fragment of a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 2 or 5, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
NO: 2 or 5; 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number , or the amino acid sequence encoded by 

the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID NO: 1 or 4 or a 
complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid 
molecule comprising a nucleotide sequence which is at 
least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l or 4 or a complement 
thereof . 

9. The isolated polypeptide of claim 8 comprising the 
amino acid sequence of SEQ ID NO: 2 or 5, the amino acid 
sequence encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number . 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

11. An antibody which selectively binds to a 
polypeptide of claim 8. 

5 12 . A method for producing a polypeptide selected from 
the group consisting of: 

a) a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2 or 5, the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 

10 Accession Number , or the amino acid sequence 

encoded by the cDNA insert of the plasmid deposited with 
ATCC as Accession Number j 

b) a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 2 or 5; the amino acid 

15 sequence encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the amino acid sequence encoded by the cDNA insert of the 

o 

plasmid deposited with ATCC as Accession Number , 

wherein the fragment comprises at least 15 contiguous 

20 amino acids of SEQ ID NO: 2 or 5, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with 

ATCC as Accession Number or the amino acid 

sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number / and 

25 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number or the amino acid sequence encoded by 

30 the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
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nucleic acid molecule comprising SEQ ID NO: 1 or 4 or a 
complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under 
conditions in which the nucleic acid molecule is 
5 expressed. 

13. A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 

10 b) determining whether the compound binds to the 
polypeptide in the sample. 

14 . The method of claim 13 , wherein the compound which 
binds to the polypeptide is an antibody. 

15. A kit comprising a compound which selectively 
15 binds to a polypeptide of claim 8 and instructions for 

use . 

16. A method for detecting the presence of a nucleic 
acid molecule of claim 1 in a sample, comprising the 
steps of : 

20 a) contacting the sample with a nucleic acid probe or 
primer which selectively hybridizes to the nucleic acid 
molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

25 17. The method of claim 16, wherein the sample 

comprises mRNA molecules and is contacted with a nucleic 
acid probe. 
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18. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 


19. A method for identifying a compound which binds to 
5 a polypeptide of claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a 
polypeptide of claim 8 with a test compound; and 

b) determining whether the polypeptide binds to the 
test compound. 

10 20. The method of claim 19, wherein the binding of the 

test compound to the polypeptide is detected by a method 

selected from the group consisting of: 

a) detection of binding by direct detecting of test 

compound/polypeptide binding; 
15 b) detection of binding using a competition binding 

assay; 

c) detection of binding using an assay for TANGO 191 
or TANGO 195-mediated signal transduction. 

21. A method for modulating the activity of a 
20 polypeptide of claim 8 comprising contacting a 

polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 
sufficient concentration to modulate the activity of the 
polypeptide . 

25 22. A method for identifying a ^compound which 
modulates the activity of a polypeptide of claim 8, 
comprising : 

a) contacting a polypeptide of claim 8 with a test 
compound ; and 
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b) determining the effect of the test compound on the 
activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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• gtcgACCCACGCS-CCGCAGAGAAGAGTTTGAGATGCTTCTTCTTCAGAGCACTTCCTAC 

' 7 aaaGAGGTATCTCTCTGGATAGGAAGAAATATAGTAGAACCCTTTGAAAATGGATATT 

- ° : --CACATATT'TCGTTCPJGATACAAAAGCTGGCAGTTACTGAAATAAGGACTTGAAGrrC 
: S ^ --^-TCC-C— ^~*r~~ A^G-CTTAAGAGCAGGAAATAAAGAGACAGCTGAAGGTGTAGCCT 

> t -GACCAACTGAAAGGGAAATCTTCATCCTCTGAAAAAACATATGTGATTCTCAAAAAACG 

3 C ' C ATC-GGAAAATTGATAAAGAAGCGATTCTGTAGATTCTCCCAGCGCTGTTGGGCTCTCA 

- C T - TT r- c -T. c -r GT GAAGGACAACATATGGTGATGGGGAAATCAGAAGCTTTGAGACCCTCTA 

r " rr^GGATA T GAATCCCCCTTCTAATACTTACCAGAAATGAAGGGGATACTCAGGGCAG 

J" , " „;^ GAAT c-CAAAACACTCTACTCTGGCAAAGGAATGAAGTTATTGGAGTGATGACA 

54l GGAACACGGGAGAACAATGCTCTGTTTGGGCTGGATATTTCTTTGGCTTGTTGCAGGAGA 
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60 ^ GCGAATTAAAGGATTTAATATTTCAGG7TGTTCCACAAAAAAACTCCTTTGGACATATTC 
,Ttfr:rNlSGCSTKKLLW.-S 

S 5 ^ TACAAGGAGTGAAGAGGAATTTGTCTTATTTTGTGATTTACCAGAGCCACAGAAATCACA 

36 TRSESEFVLFCDLPEPQKSr. 

. ^ TCTGCCACAGAAA TCGACTCTCACCAAAACAAGTCCCTGAGCACCTGCCCTTCATGGG 

55 "? C H R N R o S P K Q V ? E H - ? r H G 

7 3 1 r AGTA ACGACCTATCTGATGTCCAATGGTACGAACAACCTTCGAATGGAGATCCATTAGA 
76*SMDLSDVQWYQQPSNGD?L=. 

841 GGACATTAGGAAAAGCTATCCTCACATCATTCAGGACAAATGTACCCTTCACTTTTTGAC 
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901 CCCAGGGGTGAATAATTCTGGGTCATATATTTGTAGACCCAAGATGATTAAGA^ 

061 TGATGTAGCCTGTTGTGTCAAGATGATTTTAGAAGTTAAGCCCCAGACAAA7GCATCCTG 
VACCVKMI^E'/KPQ - M b <- 


136 0 


• 0 - ■ "GAGTATTCCGCATCACATAAGCAAGACCTACTTCTTGGGAGCACTGGCTCTATTTCTTG 
"l55 sYSASHKQOLLLGS.GSiSL 

,081 CCCCAGTCTCAGCTGCCAAAGTGATTCACA 

? S L S C Q -S uaw 

li;i AAAACTCCTCTCTGTGGAAAGGAGCAACCGAATCGTAGTGGATGAACTTTATGACTATCA 
xLL SVERSNRIVVDEVYD.r. 

CCAGGGCACATATGTATGTGATTACAC7CAGTCGGATACTGTGAGTTCGTGGACAGTCAG 
QGTYVCOYTQSDTVSSW . V R 

\GCTGTTGTTCAAGTGAGAACCATTGTGGGAGACACTAAACTCAAACCAGATATTCTGGA 
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r ? * * ~ N ? - • b 


fig. i Oo*2) 


WO 00/18800 


2 / 12 


PCT/US99/22818 


• • 4 • cCTAGAGTGGGAAGTCTCAGTACCTSAGGCGAAAAGTATTAAATCCACTTTA^GGaTGa 

„i L £WEVS-V?SAKSIKSTL.n 

. - ^ C --TGAGCGTAATAT«ATCTTGGAAAAAGTCACTCAGCGTGATCTTCGCAGGAAGTT 
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1981 
476 


. -,-~ T - GC GTCC ^GAACTCCATTGGAAACACAACCCAGTCCGTCCAACTGAAAGAAAA 

"236 Tcr"vQNSIG»TTQSVQLXSS 

• 621 GAGAGGAGTGGTGCTCCTGTACATCCTGCTTGGCACCATCGGGACCCTGGTGGCCGTGCT 

"356 R G V V r. r v r r r, r T t r r, t r, V „ 

- 681 GGCGGCGAGTGCCCTCCTCTACAGGCAC7GGATTGAAA7AGTGCTGCTGTACCGGACCTA 

"!-- i a c a r, Y RHWXEIVLLVRT 

^41 CCAGAGCAAGGATCAGACGCTTGGGGATAAAAAGGAT7TTGATGCTTTCG7ATCCTATGC 
" 39 6 QSKDQTLGDKKDrDAFVS... 

a/ —r» rrrciC^C YrC"*CTGAGTGAAGAACACTTGGCC CT 

•9Qi ^ AA ATGGAGCTCTrrTCCAAG*G«GGv.CAL-.<-AH-.>-.^« _ .. . , . 

,361 GAGCCTATTTCCTGATGTT^AGAAAA 

192 ' AGATGTGGCTCCAGGAGGAGTGTATGCAGAAGACATTGTGAGCATTATTAAGAGAAGCAG 
DVAPGGVYAEDIVSIIKRSR 

AAGAGGAATATTTATCTTGAGCCCCAACTATGTCAATGGACCCAGTATCTTTGAACTACA 
RGIFILSPNYVNGPSIFEUU 

- 10 1 TTACTTCC^AGCCAGACTCTC^ 

535 fVTWRGLKSvr 

556 aYHMPVKNSV^ 
„8I CTCTAGGA™g™ga™ 

2 3 41 ccct.aaggaatggtgaaatgagccctggagccccctccagtccagtccctgggatagaga 

"596 ? K E W * 

____ r -^, ra /-iArTCACAGCTCTGTGTGTGTGTGTTCAGGCTGATAGGAAATTCAAA 

2401 ^S^CC^ 

^gagcc^gaStccS^ 

252. • TAGAGCC ™,":^;p T ^ rA ^r C CXGTGGTCCGAGCAGAATCAGAAAATACAG 

^atactIgcI^ 
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1 GCG7CCGGA7G7T77CAC7777GGGACA7CC7G77C7GAG7CAAGA77CC7CC77C7GAA 

51 CATGGGACTTTCCAGAAGGACCACAGCTCCTCCCGTGCATCCACTCGGCCTGGGAGGTTC 

121 7GGA77T7GGCTG7CGAGGGAG777GCC7GCCTC7CCAGAGAAAGA7GG7CATGAGGCCC 
1 M V M R F 

131 C7G7GGAG7C7GC7TCTCTGGGAAGCCC7AC77CCCA77ACAGTTACTGG7GCCCAAG7G 
6LWSLLLWEALLPITVTGAQV 

241 C7GAGCAAAG7CGGGGGCTCGGTGCTGCTGG7GGCAGCGCGTCCCCCTGGCTTCCAAGTC 
26LSKVGGSVLLVAARPPGFQV 

3 0 1 CGTGAGGC7ATCTGGCGATC7CTCTGGCCTTCAGAAGAGCTCC7GGCCACGTTTTTCCGA 
46REAXWRSLWPSEELLA7FFR 

361 GGC7CCC7GGAGACTCTGTACCATTCCCGCTTCCTGGGCCGAGCCCAGC7ACACAGCAAC 
66GSLE7LYHSRFLGRAQLHSN 

421 CTCAGCC7GGAGCTCGGGCCGC7GGAG7C7GGAGACAGCGGCAAC7TC7CCGTG77GA7G 
86 1SLELGPLESGDSGNFSVLM 

481 G7GGACACAAGGGGCCAGCCC7GGACCCAGACCC7CCAGC7CAAGG7G7ACGA7GCAG7G 
106VD7RGQPW7Q7 LQLKVYDAV 

541 CCCAGGCCCG7GG7ACAAG7G7TCAT7GC7GTAGAAAGGGA7GC7CAGCCC7CCAAGACC 
126 PR-PVVQVF IAVERDAQ P SKT 

601 TGCCAGG7T77CTTGTCC7G7TGGGCCCCCAACA7CAGCGAAA7AACC7A7AGC7GGCGA 
146 CQVFLSCWAPNISEITYSWR 

661 CGGGAGACAACCATGGACTTTGGTATGGAACCACACAGCCTCTTCACAGACGGACAGGTG 
166RE77M0FGMEPHSLF7DGQV 

721 C7GAGC ATTTCCCTGGGACCAGGAGACAGAGA7GTGGCCTATTCCTGCATTGTCTCC AAC 
186 1SISLGPGDRDVAYSC IVSN 

781 CCTG7CAGC7GGGAC7TGGCCACAGTCACGCCCTGGGATAGCTGTCATCA7GAGGCAGCA 
206 PVSWDLA7V7PWDSCHHEAA 

841 CCAGGGAAGGCC7CC7ACAAAGA7G7GC7GC7GG7GG7GG7GCCTG7C7CGC7GC7CC7G 
226 PGKAS VKDVLLVVVPVSCLL 

901 ATGCTGGTTACTCTCTTCTCTGCCTGGCACTGG7GCCCCTGCTCAGGGCCCCACCTCAGA 
246 M LV7LFSAWHWC PCSG PHLR 

9 6 1 7CAAAGCAGC7C7GGA7GAGA7GGGACC7GCAGC7C7CCC7CCCCAAGG7GAC7C77AGC 
266 SKQLWMRWDLQLSLPKV7LS 

1021 AACC7C AT7TCGACAGTGGTTTGTAGCG7GGTGCACCAGGGCC7TGTTGAAC AGATCCAC 
286 :>TL*rSTVVCSVVHQGLVEQIH 

1081 ACC7CC7CTAATAAAGTTCCCA 

3 06 7 C 5 M K 7 P 
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10 20 30 40 
TAM60 fi5 MVMRPLWSL LLWEALLP ITVTGAQ VLSKVGGSVLLV AARPPGF 

5 L A tA tfDPKGI^SLTFVLFLSU^GASYGTGGRMMNCPK^ I 
10 20 30 40 50 60 

50 60 70 30 90 100 

QVREAIWRSLWPSEELIATFFRGSI^TLYHSRFLG-RAQLHSNI^LELGPI^SGDSGNFS 

HIVVTMAKSLENSVENKIVSLDPSEAG - - PPRYLGDRYKFVLE -NLTLGIRESRKEDEGW 
70 80 90 100 110 

110 120 130 140 ISO 160 

VLMVDTRGO PWTQT - - LQLKVYDA VP RP WQ VF IAVERDAQP S KTCQ VFLSCWAPN I S E I 

YLMTLEKNVS - VQRFCLQLRL YEQVSTPE I KVL NKTQENGTCTLILGCTVEKGDHV 

120 130 140 ISO 160 170 

170 180 190 200 210 220 

TYSWRRETTMDFGMEPHSLFTDGQVLSISMPGDRDVAYSCIVSNPVSWDIATVTPWDSC 
till i • * • • •••*•••••••■ • • • •••••••• » , ••• •■ 

AYSWS-EKA-- -GTHPLNPANSSHLLSLTLGPQHADNIYICTVSNPISNNSQTFSPWPGC 
180 190 200 210 220 

230 240 250 260 270 

HHEAAPGKASYKDVLLVWPVSLLLMLVTL - FS AWHWCPCSGPHLRS KQLWMRWDLQLS L 

RTDPSETKPWAVYAGLLGGVIMILIMWILQLRRRGKTNHYQTTVEKKSLTIYAQVQKPG 
230 240 250 260 270 280 

280 290 300 310 

P - KVTLSNLISTWCSWHQGLVEQIHTC SNKVPX 

PLQKICUJSFPAQDPCTTIYVAATEPVPESVQETNSITVYASVTLPES 
290 300 310 320 330 
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Score: 7.10 Seq: 71 128 Model: 6 47 

•r.rcmVsfhPcdYtlwWY. . . rNaaoi tLtlnsWq 

f AKXjO \R) "1 LPFMGSITOLSDVQ--WYQQPSNGDPI^CIS«SYPHIICOKCTLKrLTPG 117 

y EDsG t VwQnV • 

118 VNNSGSYICRP 128 


Score: 12.19 Seq: 168 223 Model; 1 47 

•GqsVTLTC&Vs . . f hPpdYt . IwWYrNaaoi cLtlnsWavEDs 

iOi 168 GSTGSISCPSLSCQSDAOSPAVTWVKNCXLLSVERSNRIVVDEVYDYHO 216 

GcYwOdV 

217 GTWCDY 223 


Score: 4.61 Seq: 266 339 


GqsVTLTCmVs . . fhPudYC . IwWYrNaaoi 

66 GKPLTISCKARfGFERVFNPVIKWYrKDSCLrWEVSVPE^SIKSTtKD 3 14 


tLClnsWovEDs .CtYwQnV* 


315 EI 


IERNIILTKVTORDLKAKFVCFV 339 
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Input file T19SAtmue9fll; Output File T195Atmue9f 11 -pat 
Sequence length 1603 

MWSLWSLLL 9 
CCCACCCGTCCGCCCACGCGTCCGCAAGGGAGGACAACGGCC ATG TGG TCC CTC TGG AGT CTT CTT CTC 6 9 

FEALLPV VVVSVQVLSKV GD 29 
TTT GAA GCT CTC CTT CCC GTT GTG GTT GTC AGT GTC CAA GTG CTA AGC AAG GTA GGG GAC 12 9 

SELL VAECPPGFQVREAIWR 49 
TCA GAG CTG CTG GTG GCC GAG TGT CCT CCG GGC TTC CAA GTG CGT GAG GCT ATC TGG CGA 18 9 

SLWPS EELLATFFRGSLETL 69 
TCT CTG TGG CCA TCG GAG GAG CTC CTG GCC ACA TTT TTC CGA GGT TCC TTG GAG ACT CTG 24 9 

yhsRFLiGRVQLYDNIjSLELG 89 
TAC CAC TCT CGT TTC CTG GGC CGA GTC CAG CTA TAT GAC AAC CTC AGC CTG GAG CTT GGA 30 9 

LKPGDS GNFSV LMVDTRGQ109 
CCC CTG AAA CCT GGA GAC AGC GGC AAT TTC TCT GTG CTG ATG GTG GAT ACA AGG GGT CAA 36 9 

TWTQTL YLKVYDAVPKPEVQ129 
ACC TGG ACC CAG ACC CTG TAT CTC AAG GTG TAC GAT GCA GTA CCC AAG CCC GAG GTT CAA 42 9 

VF TAA AEETQPLNTCQVFLS149 
GTG TTC ACT GCT GCA GCA GAG GAG ACC CAA CCC CTC AAT ACC TGT CAG GTC TTC TTG TCC 48 9 

CWAPN ISDITYSWRREGTVD169 
TGC TGG GCC CCC AAC ATC AGT GAC ATA ACC TAC AGC TGG CGA CGG GAG GGG ACA GTG GAC 54 9 

FNGE VHSHFSNGQVLSVSLG189 
TTC AAT GGT GAA GTG CAC AGC CAT TTC TCA AAT GGA CAG GTG TTA AGT GTC TCA CTG GGA 609 

DKDV AFTCIASNPVSWDM209 
CTG GGG GAC AAG GAT GTG CCC TTT ACC TGC ATT GCC TCC AAT CCT GTC AGC TGG GAT ATG 66 9 
TTVTP WESCHHEAASGKASY229 
ACC ACA GTC ACC CCC TGG GAG AGC TGC CAT CAC GAG GCA GCC TCC GGG AAG GCC TCC TAC 72 9 

KDVLL VVVPITLFLILAGLF249 
AAG GAC GTG CTA CTG GTA GTA GTG CCA ATT ACA CTG TTC CTG ATC CTG GCT GGT CTC TTT 78 9 

GA WHHGLCSGKKKDACTDGV269 
GGG GCA TGG CAC CAT GGC CTC TGC TCA GGG AAG AAG AAG GAT GCT TGC ACT GAC GGG GTG 84 9 

LPETENALV* 279 
CTT CCA GAG ACA GAG AAT GCC CTC GTA TAG 

AGGATGTCATGAGGGACAACATAAACTGGTGCTTGGACCATGATGAGATGCCCTGCTCAGCCACGTGATGCTCCACACC 958 

TGGACACCTCCAGCATCCTCATAACTGCCACAAGTCGCCCTGCCTCTAGCGGACAGCCAAGAAAACCACCATCCTGGAA 1037 

CGTCACCCTGCCCCAAACTTCCTCCTTCCTCCATCCTGTTCGCACATGCCGGATCCTCTCTGGGCAAGGTGAACTAGTA 1116 
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GGATGCTTCCTTCAGAACACAGGACTTTCTCTAGGATCCACAGAGACATTGATTATCCAAGGCATCCATTCTTCTATCA 1195 

CTGTACATAAGGTCTTGCCCAACAGCCACCAAGGGACGGCCTCCAGGCCAGGACCTTGGCTCAAAGAGAGATGAGATGT 12 74 

TTGAACTAACATGGAAATTG AGCTAACCATTGCCAACTCCAACTCCAGCCCCTGGGGTCTGAGTTCCTTGTGTTCAAGA 13 53 

TGTTATTATAAGAAAAGGCAAAG AACAGGAAATGATGGAGGGTGGGGCATTCTCTTTCTGGTCTGAAGGACTTTAAGAT 14 32 

TATCTGAGTTCAAGGGCCATCAAAAGTAAATTGAGATTACAGATGATGAGGGGTTGGTAGCTAATGTGCCATGTTGGGA 1511 

TCAAAGCCATTTTTCTGGTAGCACTATATTAATAGACACCTTTGTTGCCATTTAAAAAAAAAAAAAAAAAAAAAA 15 90 

AAAAAAAAAAAAA 1603 
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Input file T195AChpb93fl; Output File T195Athpb93 f 1 . pat 
Sequence length 1118 

GATGTTTTCACTTTTGGGACATCCTGTTCTGAGTCAAGATTCCTCCTTCTGAACATGGGACTTTCCAGAAGGACCACAG 79 

CTCCTCCCGTGCATCCACTCGGCCTGGGAGGTTCTGGATTTTGGCTGTCGAGGGAG7TTGCCTGCCTCTCCAGAGAAAG 156 

MVMRPLWSLLLWEALLFITV 20 
ATG GTC ATG AGG CCC CTG TGG AGT CTG CTT CTC TGG GAA GCC CTA CTT CCC ATT ACA GTT 218 

TGA QVLSKVGGSVLLVAARP 40 
ACT GGT GCC CAA GTG CTG AGC AAA GTC GGG GGC TCG GTG CTG CTG GTG GCA GCG CGT CCC 278 

PGFQVR. EAIWRSLWPSEELL 60 
CCT GGC TTC CAA GTC CGT GAG GCT ATC TGG CGA TCT CTC TGG CCT TCA GAA GAG CTC CTG 3 38 

ATFF RGSLETLYHSRFLGRA 80 
GCC ACG TTT TTC CGA GGC TCC CTG GAG ACT CTG TAC CAT TCC CGC TTC CTG GGC CGA GCC 3 98 

QLHSN LSLELGPLESGDSGN 100 
CAG CTA CAC AGC AAC CTC AGC CTG GAG CTC GGG CCG CTG GAG TCT GGA GAC AGC GGC AAC 4 58 

FS VLMVDTRGQPWTQTLQLK120 
TTC TCC GTG TTG ATG GTG GAC ACA AGG GGC CAG CCC TGG ACC CAG ACC CTC CAG CTC AAG 518 

VYDA VPRPVVQVFIAVERDA140 
GTG TAC GAT GCA GTG CCC AGG CCC GTG GTA CAA GTG TTC ATT GCT GTA GAA AGG GAT GCT 578 

0 p S KTCQVFLSCWAPNISEI 160 
CAG CCC TCC AAG ACC TGC CAG GTT TTC TTG TCC TGT TGG GCC CCC AAC ATC AGC GAA ATA 638 

- TyswR KETTMDFGMEPHSLF180 
ACC TAT AGC TGG CGA CGG GAG ACA ACC ATG GAC TTT GGT ATG GAA CCA CAC AGC CTC TTC 698 

TDGQ VLSISLGPGDRDVAYS200 
ACA GAC GGA CAG GTG CTG AGC ATT TCC CTG GGA CCA GGA GAC AGA GAT GTG GCC TAT TCC 7 58 

CIVSNP VSWDLATVTPWDSC220 
TGC ATT GTC TCC AAC CCT GTC AGC TGG GAC TTG GCC ACA GTC ACG CCC TGG GAT AGC TGT 818 

HHEAAPGKASYKDVLLVVVP240 
CAT CAT GAG GCA GCA CCA GGG AAG GCC TCC TAC AAA GAT GTG CTG CTG GTG GTG GTG CCT 8 78 

VSLLLMLVTLFSAWHWCPCS260 
GTC TCG CTG CTC CTG ATG CTG GTT ACT CTC TTC TCT GCC TGG CAC TGG TGC CCC TGC TCA 938 

■ GPHLRSKQLWM.-RWDLQLSLH. 280 
GGG CCC CAC CTC AGA TCA AAG CAG CTC TGG ATG AGA TGG GAC CTG CAG CTC TCC CTC CAC 998 

KVTLS NLISTVVCSVVHQGL300 
AAG GTG ACT CTT AGC AAC CTC ATT TCG ACA GTG GTT TGT AGC GTG GTG CAC CAG GGC CTT 10 5 8 

V£OIH TALIKFPSLMKKKKK320 


GTT GAA CAG ATC CAC ACT GCT CTA ATA AAG TTC CCA TCC TTA ATG AAA AAA AAA AAA AAA 
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input file T195AthLal70flO; Output File T195AthLal70f 10 .pat 
Sequence length 28 94 

M VMRPLWSLLLWE 13 
CCCACGCGTCCGCTCCAGAGAAAG ATG GTC ATG AGG CCC CTG TGG ACT CTG CTT CTC TGG GAA 63 

ALLPI TVTGAQVLSKVGGSV 33 
GCC CTA CTT CCC ATT ACA GTT ACT GC-T GCC CAA GTG CTG AGC AAA GTC GGG GGC TCG GTG 123 

LLVAARPPGFQVREAIWRSL 53 
CTG CTG GTG GCA GCG CGT CCC CCT GGC TTC CAA GTC CGT GAG GCT ATC TGG CGA TCT CTC 18 3 

w pSEELLATFFRGSLETLYH 73 
TGG CCT TCA GAA GAG CTC CTG GCC ACG TTT TTC CGA GGC TCC CTG GAG ACT CTG TAC CAT 24 3 

SRFLGRAQLHSNLSLELGPL 93 
TCC CGC TTC CTG GGC CGA GCC CAG CTA CAC AGC AAC CTC AGC CTG GAG CTC GGG CCG CTG 303 

ESGD SSNFSVLMVDTRGQPW113 
GAG TCT GGA GAC AGC AGC AAC TTC TCC GTG TTG ATG GTG GAC ACA AGG GGC CAG CCC TGG 363 

TOTLQLKVYDA VPR. PVVQVF133 
ACC CAG ACC CTC CAG CTC AAG GTG TAC GAT GCA GTG CCC AGG CCC GTG GTA CAA GTG TTC 423 

I AV ERDAQPSKTCQ. VFLSCW153 
ATT GCT GTA GAA AGG GAT GCT CAG CCC TCC AAG ACC TGC CAG GTT TTC TTG TCC TGT TGG 48 3 

APNIS EITYSWRRETTMDFG173 
GCC CCC AAC ATC AGC GAA ATA ACC TAT AGC TGG CGA CGG GAG ACA ACC ATG GAC TTT GGT 54 3 

mp PHS L ftDGQVLS I S LG PG 193 
ATG GAA CCA CAC AGC CTC TTC ACA GAC GGA CAG GTG CTG AGC ATT TCC CTG GGA CCA GGA 603 

DRDV AYSCIVSNPVSWDLAT213 
GAC AGA GAT GTG GCC TAT TCC TGC ATT GTC TCC AAC CCT GTC AGC TGG GAC TTG GCC ACA 663 

VTPWD SCHHEAAPGKASYK D233 
GTC ACG CCC TCG GAT AGC TGT CAT CAT GAG GCA GCA CCA GCG AAG GCC TCC TAC AAA GAT 72 3 

VLLV VVPV SLLLMLVTLF SA2S3 
GTG CTG CTG GTG GTG GTG CCT GTC TCG CTG CTC CTG ATG CTG GTT ACT CTC TTC TCT GCC 783 

WHWC PCSGKKKKDVHADRVG273 
TGG CAC TGG TGC CCC TGC TCA GGG AAA AAG AAA AAG GAT GTC CAT GCT GAC AGA GTG GGT 84 3 


PETENPLVQDLP* 
CCA GAG ACA GAG AAC CCC CTT GTG CAG GAT CTG CCA TAA 

AGGACAATATGAACTGATGCCTGGACTATCAGTAACCCCACTGCACAGGCACACGATGCTCTGGGACATAACTGGTGCC 
TGGAAATCACCATGGTCCTCATATCTCCCATGGGAATCCTGTCCTGCCTCGAAGGAGCAGCCTGGGCAGCCATCACACC 
ACGAGGACAGGAAGCACCAGCACGTTTCACACCTCCCCCTTCCCTCTCCCATCTTCTCATATCCTGGCTCTTCTCTGCG 


286 
882 

961 

1040 

1119 
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CAAGATGAGCCAAGCAGAACATTCCATCCAGGACACTGGAAGTTCTCCAGGATCCAGATCCATGGGGACATTAATAGTC 1198 

CAAGGCATTCCCTCCCCCACCACTATTCATAAAGTATTAACCAACTGGCACCAAGGAATTGCCTCCAGCCTGAGTCCTA 1277 

GGCTCTAAAAGATATTACATATTTGAACTAATAGAGGAACTCTGAGTCACCCATGCCAGCATCAGCTTCAGCCCCAGAC 13 36 

CCTGCAGTTTGAGATCTGATGCTTCCTGAGGGCCAAGGCATTGCTGTAAGAAAAGGTCTAGAAATAGGTGAAAGTGAGA 14 35 

GGTGGGGGACAGGGGTTTCTCTTTCTGGCCTAAGGACTTTCAGGTAATCAGAGTTCATGGGCCCTCAAAGGTAAATTGC 1514 

AGTTGTAGACACCGAGGATGGTTGACAACCCATGGTTGAGATGGGCACCGTTTTGCAGGAAACACCATATTAATAGACA 1593 

TCCTCACCATCTCCATCCGCTCTCACGCCTCCTGCAGGATCTGGGAGTGAGGGTGGAGAGTCTTTCCTCACGCTCCAGC 1672 

ACAGTGGCCAGGAAAAGAAATACTGAATTTGCCCCAGCCAACAGGACGTTCTTGCACAACTTCAAGAAAAGCAGCTCAG 1751 

CTCAGGATGAGTCTTCCTGCCTGAAACTGAGAGAGTGAAGAACCATAAAACGCTATGCAGAAGGAACATTATGGAGAGA 183 0 

AAGGGTACTGAGGCACTCTAGAATCTGCCACATTCATTTTCAAATGCAAATGCAGAAGACTTACCTTAGTTCAAGGGGA 1909 

GGGGACAAAGACCCCACAGCCCAACAGCAGGACTGTAGAGGTCACTCTGACTCCATCAAACTTTTTATTGTGGCCATCT 1988 

TAGGAAAATACATTCTGCCCCTGAATGATTCTGTCTAGAAAAGCTCTGGAGTATTGATCACTACTGGAAAAACACTTAA 2067 

GGAGCTAAACTTACCTTCGGGGATTATTAGCTGATAAGGTTCACAGT^TrCTCTCACCCAGGTGTAACTGGATTTT^ 2146 

GGGGCCTCAATCCAGTCTTGATAACAGCGAGGAAAGAGGTATTGAAGAAACAGGGCTGGGTTTGAAGTACTATTTTCCC 2225 

AGGGTGGCTTCAATCTCCCCACCTAGGATGTCAGCCCTGTCCAAGGACCTTCCCTCTTCTCCCCAGTTCCTGGGCAATC 2304 

ACTTCACCTTGGACAAAGGATCAGCACAGCTGGCCTCCAGATCCACATCACCACTCTTCCACTCGATTGTTCCCAGATC 2 3 83 

CTCCCTGCCTGGCCTGCTCAGAGGTTCCCTGTTGGTAACCTGGCTTTATCAAATTCTCATCCCTTTCCCACACCCACTT 2462 

CTCTCCTATCACCTTCCCCCAAGATTACCTGAACAGGGTCCATGGCCACTCAACCTGTCAGCTTGCACCATCCCCACCT 2 541 

GCCACCTACAGTCAGGCCACATGCCTGGTCACTGAATCATGCAAAACTGGCCTCAGTCCCTAAAAATGATGTGGAAAGG 2620 

AAAGCCCAGGATCTGACAATGAGCCCTGGTGGATTTGTGGCK3AAAAAATACACAGCACTCCCCACCTTTCTTTCGTTCA 2 699 

TCTCCAGGGCCCCACCTCAGATCAAAGCAGCTCTGGATGAGATGGGACCTGCAGCTCTCCCTCCACAAGGTGACTCTTA 2778 

GCAACCTCATTTCGACAGTGGTTTGTAGCGTGGTGCACCAGGGCCTTGTTGAACAGATCCACACTGCTCTAATAAAGTT 2857 

CCCATCCTTAATGAAAAAAAAAAAAAAAAAAAAAAAA 2894 
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